**********************************************
* DATASETS USED IN WALLDEN ET AL, 2016, CELL *
**********************************************

* DnaQ-labeled strains, pooled dataset, data organised per molecule *

 This dataset can be used to re-create Figs. 3 and 5 in Wallden et al,
 2016.

 The output is stored in tab-separated text files using the names:
 DnaQ_pooled_per_molecule_XXX_YYY.txt, where XXX are the different
 growth conditions: slow, intermediate and fast. YYY is either data or
 header, where header-files contains a legend for the content of each
 column in data-files. The structure of the data files is further
 described below.

 The content of the data-files is organised as follows:
    V1 L1 W1 T1 R1 x1 I1
    V2 L2 W2 T2 R2 x2 I2
              .
	      .
	      .
    VN LN WN TN RN xN IN

  Each line represents a detected fluorescent dot localised inside a
  cell or, if no dot is detected within a detected cell, the line
  contains only the cell size parameters. V is the cell volume
  (micrometer cubed). L is the cell length (micrometers). W is the
  most common cell width after binning into M/sqrt(2) bins equally
  spaced between the maximum and mininum observed widths, where M is
  the number of observations of the width from the active contour
  model. T is the time (minutes) after last division. R is the growth
  rate (per minutes) for the entire cell cycle. The growth rate is
  estimated by regression of all the estimated volumes in an entire
  cell cycle.  x is the location (micrometers) of a detected
  fluorescent dot along the long axis of the cell and is zero in the
  middle of the cell. If no dot is detected x is set to NaN. I is how
  many dots that were detected for one specific cell at that given
  time. This implies that if for example I=2, two lines will be the
  same except for the position of the detected fluorescent dot. N is
  then the total number of observations.

* DnaQ-labeled strains, pooled dataset, data organised per cell cycle *

 This dataset together with the dataset in "DnaQ-labeled strains,
 pooled dataset, data organised per molecule" can be used to re-create
 Fig. 5C

 The output is stored in tab-separated text files using the names:
 DnaQ_pooled_per_cell_cycle_XXX_YYY.txt where XXX are the different
 growth conditions: slow, intermediate and fast. YYY is either data or
 header, where header-files contains information about the content of
 each column in data-files.

 The content of the data-files are as follows:
    R1 G1 VB1 VD1
    R2 G2 VB2 VD2
         .
	 .
	 .
    RN GN VBN VDN	

 Each line represent a single cell tracked from one division until
 next (from "birth" to "death"). R is the growth rate (per minutes)
 for the entire cell cycle. The growth rate is estimated by regression
 of all the estimated volumes in an entire cell cycle. VB is the
 volume right after division (the "birth" volume) (micrometers cubed),
 and VD is the volume right before following division (the "death"
 volume) (micrometers cubed). N is the total number of cell cycles.


* DnaQ-labeled strain, data organised per cell cycle *

 This dataset can be used to re-create Figs. 2, 6, and 7B

 The output is stored in tab-separated text files using the names:
 DnaQ_per_cell_cycle_XXX_YYY.txt where XXX are the different growth
 conditions: slow, intermediate and fast. YYY is either data or
 header, where header-files contains information about the content of
 each column in data-files.

 The output is stored in the same way as for "DnaQ-labeled strains,
 pooled dataset, data organised per cell cycle”.


* SeqA-labeled strain, data organised per cell cycle *

 This dataset can be used to re-create Figs. 4, 7 and 5D (slow growth)

 The output is stored in tab-separated text files using the names:
 SeqA_per_cell_cycle_XXX_YYY.txt where XXX are the different growth
 conditions: slow, intermediate and fast. YYY is either data or
 header, where header-files contains information about the content of
 each column in data-files.

 The content of the data-files are as follows:
    R1 G1 VB1 VD1 VI1 TI1
    R2 G2 VB2 VD2 VI2 TI2
         .
	 .
	 .
    RN GN VBN VDN VIN TIN

 Each line represent a single cell tracked from one division until
 next (from "birth" to "death"). R is the growth rate (per minutes)
 for the entire cell cycle. The growth rate is estimated by regression
 of all the estimated volumes in an entire cell cycle, G is the
 generation time (minute), VB the volume right after division (the
 “birth” volume) (micrometers cubed) and VD is the volume right before
 division (the “death” volume) (micrometers cubed). VI is the volume
 of initiation (micrometer cubed) and TI is the time from last
 division (“birth”) to initiation (minutes). N is the total number of
 cell cycles.

* SeqA-labeled strain, data organised by mother-daughter pairs of cell cycles *

 This dataset can be used to re-create Fig 5D (intermediate growth).

 The output is stored in tab-separated text files using the names:
 SeqA_mother_daugther_XXX_YYY.txt where XXX are the different growth
 conditions: slow, intermediate and fast. YYY is either data or
 header, where header-files contains information about the content of
 each column in data-files.

 The content of the data-files are as follows:
    RM1 RD1 GD1 CD1
    RM2 RD2 GD2 CD2
         .
	 .
	 .
    RMN RDN GDN CDN

 Each line represent a cell tracked for two generations, i.e. from
 birth in the "mother" cell to death in the "daughter" cell. RM and RD
 are the growth-rates (for the entire cell cycle) of the mother and
 the daughter respectively (per minute). GD is the generation time
 (minutes) of the daughter. CD is the C+D period (minutes) from the
 initiation event in the mother generation to division in the daughter
 generation.
