NDLI: Repseek, a tool to retrieve approximate from large DNA sequences

Please wait, while we are loading the content...

Repseek, a tool to retrieve approximate from large DNA sequences

Content Provider	Semantic Scholar
Author	Achaz, Guillaume Boyer, Frédéric Viari, Alain Coissac, Eric
Copyright Year	2008
Abstract	The importance of genome redundancy has been strongly emphasized in the field of genome dynamics and evolution as well as in medical biology. A repeat is a sequence present twice or more with a high degree of similarity within a larger sequence (e.g. a chromosome) or set of sequences (e.g. a genome with several chromosomes). Each instance of the repeated sub-sequence is called a 'copy' of the repeat. We use the term ”duplication” to denote any active mechanistic event that creates a repeat. Even though spurious duplication events (or recombination events between repeats) can cause severe disorders [26, 24], repeated elements remain nonetheless a very important driving force of genome evolution [28]. In that respect, the dynamics and the evolution of these redundant sequences have been studied in bacterial genomes [31, 32, 5] as well as in eukaryote genomes [3, 4, 38]. Duplication events can sometimes copy entire coding regions, giving birth to what is often referred as duplicate genes. Those duplicate genes are the raw material leading to the emergence of novel functions and have been extensively studied (for a historical review see [37]). Although the repeats we are interested in encompass a lot of known biological repeated elements (i.e. transposable elements, duplicated genes, DNA-satellites, segmental duplication, etc.) our main concern is not to identify specific families of repeats, but to extract repeats on the sole basis of their sequence similarity and without any prior consideration of their biological function. Unlike RepeatMasker [34], we do not search for already well characterized repeated elements. Furthermore, our primary goal is not to construct families of repeats. This is the objective of dedicated software such as RepeatScout [30] or of clustering algorithms [9, 29], which reconstruct families from pairs of repeats. Of course, our program can be used to feed these clustering algorithms. While there are some widely accepted methods to detect duplicate genes in a genome (for instance based on BLAST or FASTA programs), there is no firmly established technique concerning the detection of repeats in large DNA sequences. The detection of repeats is not a trivial problem and there is no satisfactory methodology available apart from recursive local alignment (using dynamic programming) of sequences with themselves [41]. Such algorithms, however, are quadratic in computation time and in memory usage and
File Format	PDF HTM / HTML
Alternate Webpage(s)	http://www.researchgate.net/profile/Eric_Coissac/publication/6757777_Repseek_a_tool_to_retrieve_approximate_repeats_from_large_DNA_sequences/links/0deec51cc56ae91047000000.pdf
Alternate Webpage(s)	http://wwwabi.snv.jussieu.fr/public/RepSeek/Repseek_doc.pdf
Language	English
Access Restriction	Open
Content Type	Text
Resource Type	Article

Central Library (ISO-9001:2015 Certified)
Indian Institute of Technology Kharagpur
Kharagpur, West Bengal, India | PIN - 721302

See location in the Map
03222 282435
Mail: support@ndl.gov.in

Sl.	Authority	Responsibilities	Communication Details
1	Ministry of Education (GoI), Department of Higher Education	Sanctioning Authority	https://www.education.gov.in/ict-initiatives
2	Indian Institute of Technology Kharagpur	Host Institute of the Project: The host institute of the project is responsible for providing infrastructure support and hosting the project	https://www.iitkgp.ac.in
3	National Digital Library of India Office, Indian Institute of Technology Kharagpur	The administrative and infrastructural headquarters of the project	Dr. B. Sutradhar bsutra@ndl.gov.in
4	Project PI / Joint PI	Principal Investigator and Joint Principal Investigators of the project	Dr. B. Sutradhar bsutra@ndl.gov.in Prof. Saswat Chakrabarti will be added soon
5	Website/Portal (Helpdesk)	Queries regarding NDLI and its services	support@ndl.gov.in
6	Contents and Copyright Issues	Queries related to content curation and copyright issues	content@ndl.gov.in
7	National Digital Library of India Club (NDLI Club)	Queries related to NDLI Club formation, support, user awareness program, seminar/symposium, collaboration, social media, promotion, and outreach	clubsupport@ndl.gov.in
8	Digital Preservation Centre (DPC)	Assistance with digitizing and archiving copyright-free printed books	dpc@ndl.gov.in
9	IDR Setup or Support	Queries related to establishment and support of Institutional Digital Repository (IDR) and IDR workshops	idr@ndl.gov.in