20-04-2011, 03:58 PM
Bioinformatics - 2.doc (Size: 78 KB / Downloads: 85)
ABSTRACT
Bioinformatics is discipline which combines computer science, Biology, Information Technology. Bioinformatics used for storing, analysing, collecting large amount of biological information. Biological data include any type data related to plant, animal, yeast, fungi etc.
In this paper we represent how to determine genes in inherited diseases and to determine gene code for beneficial traits. Tools are used to find information about the function and location of particular genes. It gives the information of different types of tools used. Database is used to store the information and retrieve the information. Different types of databases are there depending upon there nature of biological data. Database plays very important role during data analysis. Bioinformatics uses the perl language for some projects like genome project. Bioinformatics has greatly revolutaionalized biological field.
1. INTRODUCTION
New discoveries are being made in the field of genomics, an area of study which looks at the DNA sequence of an organism in order to determine which genes code for beneficial traits and which genes are involved in the inherited diseases. With an increasing amount of information generated there is a need to have some techniques for storing and analyzing that information. Computers can really help in this process. As a result, a new research area that combines the study of biotechnology and use of computers is emerging. This field is referred to as bioinformatics. It involves advance computational methods to assist in storing and analyzing data generated from DNA sequencing, protein sequences etc.
Bioinformatics is also referred as the application of computational techniques to the management and analysis of biological information. It is an emerging discipline at the convergence of computing and life sciences aimed at development of technologies for storing, extracting, organizing, analyzing and interpreting information generated.
2. GOALS OF BIOINFORMATICS
Bioinformatics is the field of science in which biology, computer science and IT combines to form single discipline. The goal is to discover new biological insides and create worldwide perspective from which principles in biology can be found.
Bioinformatics has important sub-discipline such as
• Development of new algorithm and statistics to assess the relationship among the members of
large set of data sets.
• Analysis and interpretation of various types of data including nuclear type, amino acids sequences, protein domain, protein structure.
• Development and implementation of tools that enable efficient access and management of different types of information.
3. DATABASE
Databases organize, store, and disseminate files that contain information consisting of nucleic
Acid sequences, protein sequences and protein structures.
The databases are categorized on following two criterias
1. Depending on nature of information
2. On the manner of data storage
1. Database types depending on nature of data
Primary Databases
Composite Databases
Secondary Databases
2. Databases depending on data storage
Sequence Annotated Databases
Sequence Low Annotation Databases
Sequence Specialized Databases
Primary Sequence Databases
It is a database that stores bimolecular sequences and associated annotation information.
A) Nucleic Acid Sequence Databases
DNA are complicated 3D molecules composed of thousands or millions of atoms bonded together. DNA and proteins are polymers, chains of repeating chemical units i.e. monomers.In DNA four nucleic acid monomers (A T C and G) are commonly used to build polymer chain
Examples
• GenBank :- This is the DNA database incorporates data of DNA sequence collected globally by using publicly available resources. It contains approximately 15,850 million sequence records.
GenBank is divided into 3 sections
Header :-It contains information like sequence description, origin of organism, similar literature references. It also contains identifier at the beginning of a record.
Feature Table :- It contains description of the features in a record like coding sequences, repeats etc.
Sequence :- For analyzing it by computer more easily.
• DDBJ:-It is also called as DNA data bank Japan. It contains 50 million DNA sequences according to release 48.
• EMBL:- It is also called as European Molecular Biology Laboratory. As per release it contains 14.4 million DNA sequences.