
DASH: Searching Compressed DNA
Creating and Searching Compact Databases for The Genomic Era
Versandkostenfrei!
Versandfertig in 6-10 Tagen
52,99 €
inkl. MwSt.
PAYBACK Punkte
26 °P sammeln!
The advent of low-cost mass-sequencing of genomespresents significant data management difficulties.These will grow worse as it becomes routine tosequence the genomes of individual people andorganisms, because existing systems store and searcheach genome separately. This approach is notfeasible for searching andcomparing the genomes of millions or billions ofindividual organisms. This book seeks to solve thisproblem by describing the DASH sequence alignment andcompression algorithms. DASHmakes use of the overwhelming similarities amongstgenomes of a given species in order to compress, notonly t...
The advent of low-cost mass-sequencing of genomes
presents significant data management difficulties.
These will grow worse as it becomes routine to
sequence the genomes of individual people and
organisms, because existing systems store and search
each genome separately. This approach is not
feasible for searching and
comparing the genomes of millions or billions of
individual organisms. This book seeks to solve this
problem by describing the DASH sequence alignment and
compression algorithms. DASH
makes use of the overwhelming similarities amongst
genomes of a given species in order to compress, not
only the database size, but also the index size and
search time. The resulting novel approach to
database compression, index compression,
bioinformatics and information-retrieval should be of
especial interest to anyone who has an interest in
the storage and efficient searching of large data
sets, whether DNA or any other subject which offers
some degree of redundancy, such as natural language
text or web pages.
presents significant data management difficulties.
These will grow worse as it becomes routine to
sequence the genomes of individual people and
organisms, because existing systems store and search
each genome separately. This approach is not
feasible for searching and
comparing the genomes of millions or billions of
individual organisms. This book seeks to solve this
problem by describing the DASH sequence alignment and
compression algorithms. DASH
makes use of the overwhelming similarities amongst
genomes of a given species in order to compress, not
only the database size, but also the index size and
search time. The resulting novel approach to
database compression, index compression,
bioinformatics and information-retrieval should be of
especial interest to anyone who has an interest in
the storage and efficient searching of large data
sets, whether DNA or any other subject which offers
some degree of redundancy, such as natural language
text or web pages.