TEtrimmer: a tool to automate the manual curation of transposable elements
Jiangzhao Qian,
Hang Xue,
Shujun Ou,
Ludwig Mann,
Jessica Storer,
Lisa Fürtauer,
Tony Heitkam,
Mary C. Wildermuth,
Stefan Kusch () and
Ralph Panstruga ()
Additional contact information
Jiangzhao Qian: Worringerweg 1
Hang Xue: University of California
Shujun Ou: 592 Aronoff Laboratory, 318W 12th Avenue
Ludwig Mann: Worringerweg 3
Jessica Storer: Unit 3179
Lisa Fürtauer: Worringerweg 1
Tony Heitkam: Worringerweg 3
Mary C. Wildermuth: University of California
Stefan Kusch: Worringerweg 1
Ralph Panstruga: Worringerweg 1
Nature Communications, 2025, vol. 16, issue 1, 1-20
Abstract:
Abstract Transposable elements (TEs) are repetitive DNA sequences that move within genomes and play important roles in gene regulation and genome evolution. Accurate TE annotation in genomes is crucial for downstream analyses but challenging due to their sequence diversity and frequent fragmentation, including the occurrence of nested copies. We here present TEtrimmer, a tool that automates and replaces key steps of traditional manual curation of TEs. TEtrimmer combines phylogenetic tree analysis with the machine learning method DBSCAN to cluster TE sequences accurately and applies a sliding-window strategy to remove poorly conserved regions of TE-derived multiple sequence alignments. TEtrimmer also provides detailed report plots and features a graphical user interface (GUI) application. Tested on the genomes of six organisms belonging to various kingdoms of eukaryotic life and three simulated genomes, TEtrimmer consistently improved the identification of intact TEs compared to the established tools EDTA and RepeatModeler2.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.nature.com/articles/s41467-025-63889-y Abstract (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-025-63889-y
Ordering information: This journal article can be ordered from
https://www.nature.com/ncomms/
DOI: 10.1038/s41467-025-63889-y
Access Statistics for this article
Nature Communications is currently edited by Nathalie Le Bot, Enda Bergin and Fiona Gillespie
More articles in Nature Communications from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().