/doc/sphinx_doc/build/text/tuto.txt - Diff - NucleoMiner - Forge du Centre Blaise Pascal

Révision 6e0010bc doc/sphinx_doc/build/text/tuto.txt

     the 53 samples is indentify by a uniq identifier. The file
     *CSV_SAMPLE_FILE* sums up this information.
     configurator.CSV_SAMPLE_FILE = None
        Path to cvs file that contains sample information.
     We use a convention to link sample and Illumina fastq outputs.
     Illumina output files of the sample *ID* will be stored in the
     directory *ILLUMINA_OUTPUTFILE_PREFIX* + *ID*. For example, sample 41
     outputs will be stored in the directory
     *data/2012-09-05/FASTQ/Sample_Yvert_Bq41/*.
     configurator.ILLUMINA_OUTPUTFILE_PREFIX = None
        Prefix for Illumina fastq output files.
     For BY (resp. RM and YJM) we use following reference genome
     *saccharomyces_cerevisiae_BY_S288c_chromosomes.fasta* (resp.
     *saccharomyces_cerevisiae_rm11-1a_1_supercontigs.fasta* and
     *saccharomyces_cerevisiae_YJM_789_screencontig.fasta*). The index
     *FASTA_REFERENCE_GENOME_FILES* stores this information.
     configurator.FASTA_REFERENCE_GENOME_FILES = None
        Dictionary where each fasta reference genomes is indexed by
        reference strain that it corresponds.
     Each chromosome/contig is identify in the fasta file by an obscure
     identifier. For example, BY chromosome I is identify by
     *gi|144228165|ref|NC_001133.7|* when TemplateFilter is waiting for an
     integer. So, we translate it. The index *FASTA_INDEXES* stores this
     translation.
     configurator.FASTA_INDEXES = None
        Dictionary of strain that indexes dictionaries where keys are
        chromosome reference from Fastq file and value are its
        correspondance for Templatefilter.
     From a pragamatical point of view we discard some part of the genome
     (repeated sequence etc...). The list of the black listed area is
     explicitely detailled in *AREA_BLACK_LIST*.
     configurator.AREA_BLACK_LIST = None
        Dictionary where keys are strain and values are black listed of
        geneome region.
     For BY-RM (resp. BY-YJM and RM-YJM) genome sequence alignment we use
     previously compute .c2c file
     *data/2012-03_primarydata/BY_RM_gxcomp.c2c* (resp.
-...
     *NucleoMiner*, the old version of *NucleoMiner2* (http://www.ens-
     lyon.fr/LBMC/gisv/NucleoMiner_Manual/manual.pdf).
     configurator.C2C_FILES = None
        Dictionary where each strain combination indexes genome aligment.
     *nucleominer* uses specific directory to work in, these are described
     in *INDEX_DIR*, *ALIGN_DIR* and *LOG_DIR*.

Formats disponibles : Unified diff

LBMC » NucleoMiner

Révision 6e0010bc doc/sphinx_doc/build/text/tuto.txt