Detecting genomic indel variants with exact breakpoints in single- and paired-end sequencing data using SplazerS. Emde, A., Schulz, M. H., Weese, D., Sun, R., Vingron, M., Kalscheuer, V. M., Haas, S. A., & Reinert, K. Bioinformatics (Oxford, England), 28(5):619–627, March, 2012. doi abstract bibtex MOTIVATION: The reliable detection of genomic variation in resequencing data is still a major challenge, especially for variants larger than a few base pairs. Sequencing reads crossing boundaries of structural variation carry the potential for their identification, but are difficult to map. RESULTS: Here we present a method for 'split' read mapping, where prefix and suffix match of a read may be interrupted by a longer gap in the read-to-reference alignment. We use this method to accurately detect medium-sized insertions and long deletions with precise breakpoints in genomic resequencing data. Compared with alternative split mapping methods, SplazerS significantly improves sensitivity for detecting large indel events, especially in variant-rich regions. Our method is robust in the presence of sequencing errors as well as alignment errors due to genomic mutations/divergence, and can be used on reads of variable lengths. Our analysis shows that SplazerS is a versatile tool applicable to unanchored or single-end as well as anchored paired-end reads. In addition, application of SplazerS to targeted resequencing data led to the interesting discovery of a complete, possibly functional gene retrocopy variant. AVAILABILITY: SplazerS is available from http://www.seqan.de/projects/ splazers. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
@article{emde_detecting_2012,
title = {Detecting genomic indel variants with exact breakpoints in single- and paired-end sequencing data using {SplazerS}},
volume = {28},
issn = {1367-4811},
doi = {10.1093/bioinformatics/bts019},
abstract = {MOTIVATION: The reliable detection of genomic variation in resequencing data is still a major challenge, especially for variants larger than a few base pairs. Sequencing reads crossing boundaries of structural variation carry the potential for their identification, but are difficult to map.
RESULTS: Here we present a method for 'split' read mapping, where prefix and suffix match of a read may be interrupted by a longer gap in the read-to-reference alignment. We use this method to accurately detect medium-sized insertions and long deletions with precise breakpoints in genomic resequencing data. Compared with alternative split mapping methods, SplazerS significantly improves sensitivity for detecting large indel events, especially in variant-rich regions. Our method is robust in the presence of sequencing errors as well as alignment errors due to genomic mutations/divergence, and can be used on reads of variable lengths. Our analysis shows that SplazerS is a versatile tool applicable to unanchored or single-end as well as anchored paired-end reads. In addition, application of SplazerS to targeted resequencing data led to the interesting discovery of a complete, possibly functional gene retrocopy variant.
AVAILABILITY: SplazerS is available from http://www.seqan.de/projects/ splazers.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.},
language = {eng},
number = {5},
journal = {Bioinformatics (Oxford, England)},
author = {Emde, Anne-Katrin and Schulz, Marcel H. and Weese, David and Sun, Ruping and Vingron, Martin and Kalscheuer, Vera M. and Haas, Stefan A. and Reinert, Knut},
month = mar,
year = {2012},
pmid = {22238266},
keywords = {Algorithms, Genomics, Humans, INDEL Mutation, Sequence Analysis, DNA},
pages = {619--627},
file = {Volltext:/Users/mschulz/Zotero/storage/45JYHFG2/Emde et al. - 2012 - Detecting genomic indel variants with exact breakp.pdf:application/pdf},
}
Downloads: 0
{"_id":"RGiqcCPQ3ExwkzfZb","bibbaseid":"emde-schulz-weese-sun-vingron-kalscheuer-haas-reinert-detectinggenomicindelvariantswithexactbreakpointsinsingleandpairedendsequencingdatausingsplazers-2012","downloads":0,"creationDate":"2018-04-07T06:42:02.502Z","title":"Detecting genomic indel variants with exact breakpoints in single- and paired-end sequencing data using SplazerS","author_short":["Emde, A.","Schulz, M. H.","Weese, D.","Sun, R.","Vingron, M.","Kalscheuer, V. M.","Haas, S. A.","Reinert, K."],"year":2012,"bibtype":"article","biburl":"https://raw.githubusercontent.com/SchulzLab/MakeTitlesGreatAgain/master/Publications.bib","bibdata":{"bibtype":"article","type":"article","title":"Detecting genomic indel variants with exact breakpoints in single- and paired-end sequencing data using SplazerS","volume":"28","issn":"1367-4811","doi":"10.1093/bioinformatics/bts019","abstract":"MOTIVATION: The reliable detection of genomic variation in resequencing data is still a major challenge, especially for variants larger than a few base pairs. Sequencing reads crossing boundaries of structural variation carry the potential for their identification, but are difficult to map. RESULTS: Here we present a method for 'split' read mapping, where prefix and suffix match of a read may be interrupted by a longer gap in the read-to-reference alignment. We use this method to accurately detect medium-sized insertions and long deletions with precise breakpoints in genomic resequencing data. Compared with alternative split mapping methods, SplazerS significantly improves sensitivity for detecting large indel events, especially in variant-rich regions. Our method is robust in the presence of sequencing errors as well as alignment errors due to genomic mutations/divergence, and can be used on reads of variable lengths. Our analysis shows that SplazerS is a versatile tool applicable to unanchored or single-end as well as anchored paired-end reads. In addition, application of SplazerS to targeted resequencing data led to the interesting discovery of a complete, possibly functional gene retrocopy variant. AVAILABILITY: SplazerS is available from http://www.seqan.de/projects/ splazers. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.","language":"eng","number":"5","journal":"Bioinformatics (Oxford, England)","author":[{"propositions":[],"lastnames":["Emde"],"firstnames":["Anne-Katrin"],"suffixes":[]},{"propositions":[],"lastnames":["Schulz"],"firstnames":["Marcel","H."],"suffixes":[]},{"propositions":[],"lastnames":["Weese"],"firstnames":["David"],"suffixes":[]},{"propositions":[],"lastnames":["Sun"],"firstnames":["Ruping"],"suffixes":[]},{"propositions":[],"lastnames":["Vingron"],"firstnames":["Martin"],"suffixes":[]},{"propositions":[],"lastnames":["Kalscheuer"],"firstnames":["Vera","M."],"suffixes":[]},{"propositions":[],"lastnames":["Haas"],"firstnames":["Stefan","A."],"suffixes":[]},{"propositions":[],"lastnames":["Reinert"],"firstnames":["Knut"],"suffixes":[]}],"month":"March","year":"2012","pmid":"22238266","keywords":"Algorithms, Genomics, Humans, INDEL Mutation, Sequence Analysis, DNA","pages":"619–627","file":"Volltext:/Users/mschulz/Zotero/storage/45JYHFG2/Emde et al. - 2012 - Detecting genomic indel variants with exact breakp.pdf:application/pdf","bibtex":"@article{emde_detecting_2012,\n\ttitle = {Detecting genomic indel variants with exact breakpoints in single- and paired-end sequencing data using {SplazerS}},\n\tvolume = {28},\n\tissn = {1367-4811},\n\tdoi = {10.1093/bioinformatics/bts019},\n\tabstract = {MOTIVATION: The reliable detection of genomic variation in resequencing data is still a major challenge, especially for variants larger than a few base pairs. Sequencing reads crossing boundaries of structural variation carry the potential for their identification, but are difficult to map.\nRESULTS: Here we present a method for 'split' read mapping, where prefix and suffix match of a read may be interrupted by a longer gap in the read-to-reference alignment. We use this method to accurately detect medium-sized insertions and long deletions with precise breakpoints in genomic resequencing data. Compared with alternative split mapping methods, SplazerS significantly improves sensitivity for detecting large indel events, especially in variant-rich regions. Our method is robust in the presence of sequencing errors as well as alignment errors due to genomic mutations/divergence, and can be used on reads of variable lengths. Our analysis shows that SplazerS is a versatile tool applicable to unanchored or single-end as well as anchored paired-end reads. In addition, application of SplazerS to targeted resequencing data led to the interesting discovery of a complete, possibly functional gene retrocopy variant.\nAVAILABILITY: SplazerS is available from http://www.seqan.de/projects/ splazers.\nSUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.},\n\tlanguage = {eng},\n\tnumber = {5},\n\tjournal = {Bioinformatics (Oxford, England)},\n\tauthor = {Emde, Anne-Katrin and Schulz, Marcel H. and Weese, David and Sun, Ruping and Vingron, Martin and Kalscheuer, Vera M. and Haas, Stefan A. and Reinert, Knut},\n\tmonth = mar,\n\tyear = {2012},\n\tpmid = {22238266},\n\tkeywords = {Algorithms, Genomics, Humans, INDEL Mutation, Sequence Analysis, DNA},\n\tpages = {619--627},\n\tfile = {Volltext:/Users/mschulz/Zotero/storage/45JYHFG2/Emde et al. - 2012 - Detecting genomic indel variants with exact breakp.pdf:application/pdf},\n}\n\n","author_short":["Emde, A.","Schulz, M. H.","Weese, D.","Sun, R.","Vingron, M.","Kalscheuer, V. M.","Haas, S. A.","Reinert, K."],"key":"emde_detecting_2012","id":"emde_detecting_2012","bibbaseid":"emde-schulz-weese-sun-vingron-kalscheuer-haas-reinert-detectinggenomicindelvariantswithexactbreakpointsinsingleandpairedendsequencingdatausingsplazers-2012","role":"author","urls":{},"keyword":["Algorithms","Genomics","Humans","INDEL Mutation","Sequence Analysis","DNA"],"metadata":{"authorlinks":{}},"downloads":0},"search_terms":["detecting","genomic","indel","variants","exact","breakpoints","single","paired","end","sequencing","data","using","splazers","emde","schulz","weese","sun","vingron","kalscheuer","haas","reinert"],"keywords":["algorithms","genomics","humans","indel mutation","sequence analysis","dna"],"authorIDs":[],"dataSources":["TEyqHfEXbZQMpLPQf","CX2cjnP6abPqfH9mY"]}