Article Details : |
| | Article Name : | | Analysis of Compression Techniques for DNA
Sequence Data | Author Name : | | SHAKEELA BIBI
JAVED IQBAL
ADNAN IFTEKHAR
MIR HASSAN | Publisher : | | Bridge Center | Article URL : | | | Abstract : | | Biological data mainly comprises of Deoxyribonucleic acid
(DNA) and protein sequences. These are the biomolecules which are
present in all cells of human beings. Due to the self-replicating
property of DNA, it is a key constitute of genetic material that exist in
all breathing creatures. This biomolecule (DNA) comprehends the
genetic material obligatory for the operational and expansion of all
personified lives. To save DNA data of single person we require 10CDROMs. Moreover, this size is increasing constantly, and more and
more sequences are adding in the public databases. This abundant
increase in the sequence data arise challenges in the precise
information extraction from this data. Since many data analyzing and
visualization tools do not support processing of this huge amount of
data. To reduce the size of DNA and protein sequence, many scientists
introduced various types of sequence compression algorithms such as
compress or gzip, Context Tree Weighting (CTW), Lampel ziv Welch
(LZW), arithmetic coding, run length encoding and substitution
method etc. These techniques have sufficiently contributed to minimize
the volume of the biological datasets. On the other hand, the
traditional compression techniques are also not much suitable for the
compression of these types of sequential data. In this paper, we have
explored diverse types of techniques for compression of large amount of
DNA Sequence data. In this paper, the analysis of techniques reveals
that efficient techniques not only reduce the size of the sequence but also avoid from any information loss. The review of existing studies
also shows that compression of DNA sequence is significant for
understanding the critical characteristics of DNA data in addition to
improving storage efficiency and data transmission. In addition, the
compression of protein sequence is a challenge for the research
community. The major parameters for evaluation of these compression
algorithms includes compression ratio, running time complexity etc. | Keywords : | | Huffman Coding, Extended ASCII Illustration, DNA
Encryption, Decompression |
|
|
|
|
|
|
Announcements
|
- Volume 9, issue 5, August issue - publishing in progress
- Volume 9, Issue 2/ May 2021 publishing IN PROGRESS
- Call for Papers: open. Articles will be uploaded constantly, once the evaluation process is complete.
- The Certificates of Publication can be downloaded from RECEIVED ARTICLES section.
- COVER - VOLUME 2, ISSUE 12 / MARCH 2015
- COVER - VOLUME 3, ISSUE 2 / MAY 2015
- COVER - VOLUME 3, ISSUE 3 / JUNE 2015
- COVER - VOLUME 3, ISSUE 4 / JULY 2015
- COVER - VOLUME 3, ISSUE 8/ NOVEMBER 2015
- COVER - VOLUME 4, ISSUE 4 / JULY 2016
- COVER - VOLUME 4, ISSUE 5 / AUGUST 2016
- COVER - VOLUME 7, ISSUE 11/ FEBRUARY 2020
- Volume 9, issue 3, June 2021 publishing in progress
|
|
New Launched Project |
|
|
Recommend & Share
|
|