Difference between revisions of "Preparing reference sequence"

From GenomeView Manual
Jump to navigation Jump to search
 
Line 1: Line 1:
 
To be able to easily handle large reference genomes, it is required that they are indexed. This can be done with the <em>faidx</em> command from the samtools package.
 
To be able to easily handle large reference genomes, it is required that they are indexed. This can be done with the <em>faidx</em> command from the samtools package.
  
If you are also preparing HTS data sets in the BAM format, this step will also be part of that procedure, so either you move right to the [[BAM|short read preparation page]] or you can skip the step there whenever you're ready.
+
If you are also preparing HTS data sets in the BAM format, this step will also be part of that procedure, so either you move right to the [[Preparing_read_data|short read preparation page]] or you can skip the step there whenever you're ready.
  
 
To index a fasta file you run  
 
To index a fasta file you run  
Line 10: Line 10:
 
If your file was called reference.fasta, GenomeView will search for reference.fasta.fai in the same directory. If you want to be able to load large files, make sure those two files are correctly named and in the same folder.
 
If your file was called reference.fasta, GenomeView will search for reference.fasta.fai in the same directory. If you want to be able to load large files, make sure those two files are correctly named and in the same folder.
  
You can <a href="http://samtools.sourceforge.net/">download the samtools package from Sourceforge</a>.
+
You can [http://samtools.sourceforge.net download the samtools package from Sourceforge].

Latest revision as of 20:01, 18 November 2013

To be able to easily handle large reference genomes, it is required that they are indexed. This can be done with the faidx command from the samtools package.

If you are also preparing HTS data sets in the BAM format, this step will also be part of that procedure, so either you move right to the short read preparation page or you can skip the step there whenever you're ready.

To index a fasta file you run

samtools faidx reference.fasta


Attention

If your file was called reference.fasta, GenomeView will search for reference.fasta.fai in the same directory. If you want to be able to load large files, make sure those two files are correctly named and in the same folder.

You can download the samtools package from Sourceforge.