Reptile is a software developed in C++ for correcting sequencing errors in short reads from next-gen sequencing platforms. Reptile has several favorable properties:
Reptile has been developed by Xiao Yang, Karin Dorman and Srinivas Aluru.
The program is available under GNU Lesser General Public License version 3 with some components under Boost Software License version 1.0.
Note: the default values of program parameters are dataset dependent, i.e., they vary as dataset changes and hence are not “fixed” or “standard”. The calculation of these parameters can be automated but currently, many of them need to be set manually using the method explained in the paper (there is no assumption of any information of the reference genome). In general, the default parameters are chosen based on the histograms of quality scores, tile occurrences, and so on, of the dataset under consideration.
Here is a brief tutorial demonstrating how to use Reptile, including data preparation, parameter tuning, error correction, and generation of corrected short reads.
When using Reptile please cite:
X. Yang, K. Dorman and S. Aluru, “Reptile: Representative tiling for short read error correction”, Bioinformatics, 26(20), 2526-2533, 2010.