《生物信息学》实验教学安排:.doc

上传人:创****公 文档编号:1840091 上传时间:2019-10-27 格式:DOC 页数:13 大小:68KB
返回 下载 相关 举报
《生物信息学》实验教学安排:.doc_第1页
第1页 / 共13页
《生物信息学》实验教学安排:.doc_第2页
第2页 / 共13页
点击查看更多>>
资源描述

《《生物信息学》实验教学安排:.doc》由会员分享,可在线阅读,更多相关《《生物信息学》实验教学安排:.doc(13页珍藏版)》请在得力文库 - 分享文档赚钱的网站上搜索。

1、Experiments in Bioinformatics (生物信息学实验指导)Experiments in Bioinformatics Edited by Longjiang FanExperiment 1. Construction of Genetic Maps Experiment 2. Analysis of DNA Sequence Experiment 3. Analysis of Protein Sequence Experiment 4. Multiple Sequence Alignments Experiment 5. Analysis of Gene Functio

2、n Last updated: 2002/1/16Experiments in Bioinformatics (生物信息学实验指导)Experiment 1. Construction of Genetic Maps( (2 学学时时) )THE PURPOSE About 400-1000 bases can been read in a sequencing run with the most modern sequencer and sequencing projects, such as the human genome project (HGP), have presented ma

3、ny challenges for sequence assembly because of the size and complexity of their genomes. So, construction of a landmark map of whole genome is essential for the accurate assembly of a special genome. Genetic (linkage) maps are those basic “landmark map”. In HGP, “four sheets of map” are its main tar

4、get, i.e. genetic map, physical map, sequence map and genes map. Furthermore, genetic maps are important for plant breeding, genetic disease research and the like. QTL (Quantitative Traits Loci) mapping is such efforts of construction of genetic maps. Many genes responsible for polygenic inheritance

5、 of particular characteristics are scattered around the genome. Their positions are known as quantitative traits loci (QTL). It is useful to know where they are for medical and agricultural reasons. In the case of animal and plant breeding it would be useful to identify young individuals with favora

6、ble alleles without waiting for their expression at maturity. The procedure in breeding situations is to take inbred lines that differ in the trait of interest, and also varies for markers typically variable number tandem repeats (VNTRs) at numerous probe sites. They are crossed and both the F2 prog

7、eny and later generations are examined for the desired trait and for the variations at the probe sites. If the presence of the trait correlates with inheritance of a particular marker allele, it is likely that one or more genes affecting the trait is located on the DNA close to that marker. The same

8、 procedure can be followed in human families, particularly for disease susceptibility loci, but is more complicated and difficult because the family sizes are smaller. The usual case is that one or two genes cause most of the variation, and there are increasingly more genes with smaller effects. Gen

9、es that contribute 5% or less to the variation in a trait are very difficult to find.In this experiment you will understand the steps to map QTL and use QTL mapping programs based on different methods, i.e. interval mapping (IM), composite interval mapping (CIM) and mixed-model based composite inter

10、val mapping (MCIM) to finish QTL map based on relative experimental data.LIST OF MATERIALS AND TOOLS Materials: Two set experiment data (D:): Mletest.mcd (for WinQTLCart) or Mletest.map and Mletest.txt (for QTLMapper); riceQTL.map and riceQTL.txt (for QTLMapper) Tools:Windows QTL Cartographer (WinQT

11、LCart) Version 1.30 by Shengchu Wang, Experiments in Bioinformatics (生物信息学实验指导)C. J. Baston and Z. B. Zeng or MAPMAKER/QTL Version 1.1 by Stephen E. Lincoln, Mark J. Daly and Eric S. Lander; QTLMapper Version 1.0 by Daolong Wang, Jun Zhu, et al.PROCEDURE Download and setup the software (WinQTLCart a

12、nd QTLMapper), data files and this guide The files list for this experiment: WinQTLCart /QTLMapper /Bioinformaticsguide.doc/ riceQTL.map and riceQTL.txt /Mletest.map and Mletest.txt Download those file from Bioinplant Lab page ( QTLs based on IM and CIM with WinQTLCart Step1. Start by inputing or cr

13、eating source data and verify the data Open and verify Mletest.mcd file. Step2. View and modify the source data Recognize its data body Step3. Select “Interval mapping” item in METHOD menu for the source data analysis The result file mletest-I.qrt is created. Step4. Select “Composite Interval mappin

14、g” item in METHOD menu for the source data analysis The result file mletest-C.qrt is created. Step5. View the mapping results in graphics and compare the results Open mletest-I.qrt and mletest-C.qrt respectively and compare their LR or LOD graphs.Mapping QTLs based on MCIM with QTLMapper Step1. Star

15、t by knowing menu system of QTLMapper 1.0 Step2. Preparing input files To use QTLMapper 1.0 for QTL mapping analysis, you need to get your marker linkage map, and data of markers and traits into two plain text files (a map file and a data file) in a format recognized by QTLMapper 1.0. These files ar

16、e collectively called Input Files. Step3. Working with “File” sub-menu File sub-menu performs operations related to input files. Open the map file (mletest.map) and the data file (mletst.txt) respectively. Step4. Working with “Run” sub-menu Run sub-menu implements all the operations related to mappi

17、ng QTLs. Some suggestions: Select “2. Map main-effect QTL” to run; Change the “Genomic range” of “setting mapping ranges” into “All”(i.e. whole genome);Experiments in Bioinformatics (生物信息学实验指导)Change the “No” for “For all testing points” of “How to save results” into “Yes”; Run filtration, Bayesian

18、test and calculation. Four main files are created at d: of your computer: mletest.qtl, mletest.flq, mletest.bye and mletest.ctq Step5. Working with “Output” sub-menu Output sub-menu is designed for processing the original result file from mapping QTLs with additive/epistatic effect so that the manua

19、l work for presentation with the original result file can be largely reduced. In addition, Output sub-menu can also be used for obtaining the results of hypothesis test using some special methods. You can make LR graph file (mletest.plt) at this step. Run Wgnuplot software (packed with QTLMapper) an

20、d open the mletest.plt file. Compare the LR graph with the other two graphs created by WinQTLCart. Step6. Understanding files created by QTLMapper1.0 Several kinds of result files will be created from the analysis for mapping QTLs with QTLMapper 1.0. To make inferences about the putative QTLs for th

21、e traits under study, the user needs to understand the contents of these result files. In general, every result file consists of two portions: description of conditions on which the result is obtained, and the result body. In these files, there is usually a word “End” that ends the files. Focus on t

22、he mletest.flq file (open it with Notepan).QUESTIONS FOR DISCUSSION 1. How many QTLs can been mapped in the rice experiment (see riceQTL file)? Where are their locations?Experiments in Bioinformatics (生物信息学实验指导)Experiment 2. Analysis of DNA Sequence( (2 学学时时) )THE PURPOSE 1. Molecular databases: you

23、 will learn how to use and understand molecular databases that store the wealth of information that is so useful to the molecular biologist, such as finding and retrieving sequence in public databases, how to read the coding of database entries, etc. 2. Similarity searching: perform your own similar

24、ity searches of provided “unknown” sequences on the nucleotide databases with BLAST, the most popular sequence alignment search tool. You have been given a unknown sequence to identify, but no clues as to what it is. The provider wants an unbiased opinion.LIST OF MATERIALS AND TOOLS SOD gene sequenc

25、es ; Two unknown sequences: AAAAGAAAAGGTTAGAAAGATGAGAGATGATAAAGGGTCCATTTGAGGTTAGGTAA TATGGTTTGGTATCCCTGTAGTTAAAAGTTTTTGTCTTATTTTAGAATACTGTGAT CTATTTCTTTAGTATTAATTTTTCCTTCTGTTTTCCTCATCTAGGGAACCCCAAGA GCATCCAATAGAAGCTGTGCAATTATGTAAAATTTTCAACTGTCTTCCTCAAAATA AAGAAGTATGGTAATCTTTACCTGTATACAGTGCAGAGCCTTCT

26、CAGAAGCACAGA ATATTTTTATATTTCCTTTATGTGAATTTTTAAGCTGCAAATCTGATGGCCTTAAT TTCCTTTTTGACACTGAAAGTTTTGTAAAAGAAATCATGTCCATACACTTTGTTGC AAGATGTGAATTATTGACACTGAACTTAATAACTGTGTACTGTTCGGAAGGGGTTC CTCAAATTTTTTGACTTTTTTTGTATGTGTGTTTTTTCTTTTTTTTTAAGTTCTTA TGAGGAGGGGAGGGTAAATAAACCACTGTGCGTCTTGGTGTAATTTGAAGATTGCC CC

27、ATCTAGACTAGCAATCTCTTCATTATTCTCTGCTATATATAAAACGGTGCTGTG AGGGAGGGGAAAAGCATTTTTCAATATATTGAACTTTTGTACTGAATTTTTTTGTA ATAAGCAATCAAGGTTATAATTTTTTTTAAAATAGAAATTTTGTAAGAAGGCAATA TTAACCTAATCACCATGTAAGCACTCTGGATGATGGATTCCACAAAACTTGGTTTT ATGGTTACTTCTTCTCTTAGATTCTTAATTCATGAGGAGGGTGGGGGAGGGAGGTG GAGGGAGGGAAGGGTTT

28、CTCTATTAAAATGCATTCGTTGTGTTTTTTAAGATAGTG TAACTTGCTTAAATTTCTTATGTGACATTAACAAATAAAAAAGCTCTTTTAATATTA GATAA GTCCGGCCTGGGCGACAGAGCAAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAA Databases: three main public databases(GenBank,EMBL and DDBJ). Tools: PubMed;Entrez and SRS;BLASTPROCEDURE Step1: Obtaining a

29、sequence of interest There are many ways to obtain a sequence of interest (SOD gene in this experiment) through or :Experiments in Bioinformatics (生物信息学实验指导)Search GenBank (or EMBL and DDBJ) for sequences of interest (US). Search PubMed, a public version of full Medline for topics of interest (US).

30、 Search a variety of sequence and structures databases using the SRS server at EMBL (Germany) or Entrez server at NCBI (US). Step2: Reading database entries (records) Step3: Find similar sequence in the databases with BLAST 1. Go to EXPASY mirror site at Peking University () 2. Enter the EXPASY (EMB

31、net) Basic BLAST Server WWW page If for any reason, you cannot access the EXPASY BLAST server, you can use any other BLAST server, such as the BLAST server of 3. Select the program: BLASTN This is the BLAST program that will compare a nucleotide query sequence against a nucleotide database. 4. Sele

32、ct the DNA database: All EMBL (without GHT and ESTs) This is the main EMBL nucleotide database. 5. Ignore the matrix option. It is not used by BLASTN. 6. Select sequence input format: Plain Text You will be submitting the nucleotide sequence in plain text. 7. Select the following: Gapped Alignment:

33、ON; BLAST filter: ON; Graphic Output: ON. These are all ON by default. 8. Paste the query sequence into the specified area. 9. Hit the button: Run BLAST 10. Wait as your query is processed by the server. 11. Examine the output. Step4: Reading the output of BLAST Sep5: Understanding BLAST 1. Copy the

34、 second unknown query sequence into the pasting window and run a same Basic BLAST search. Examine the result.QUESTIONS FOR DISCUSSION 1. There are three main different searching programs (BLAST, FASTA and BLITZ) available. Which program is best to use for a certain type of sequence? 2. Explain the r

35、esult of Step5.Experiments in Bioinformatics (生物信息学实验指导)Experiment 3. Analysis of Protein Sequence( (2 学学时时) )THE PURPOSE 1. Protein sequence databases: There are two major, non-specialised protein databases that you will frequently encounter: PIR and SWISS-PROT. Unlike the three major nucleotide da

36、tabases, the entries in PIR and SWISS-PROT are not mirrored (copied). Each one has its advantages and disadvantages, which you should consider before deciding which database to search. you will learn how to use and understand the entries in those databases. 2. Protein databases searching: Protein da

37、tabase searching is the most important method to master. It is between two and five times more sensitive than DNA database searching. Perform similarity searching in protein database which you specify with the same programs (BALST or FASTA).LIST OF MATERIALS AND TOOLS Protein id CAA32643.1 and CAA00

38、826.1 A human amino acid sequence: MSTAVLENPGLGRKLSDFGQETSYIEDNCNQNGAISLIFSLKEEVGALAK VLRLFEENDVNLTHIESRPSRLKKDEYEFFTHLDKRSLPALTNIIKILRHDI GATVHELSRDKKKDTVPWFPRTIQELDRFANQILSYGAELDADHPGFKD PVYRARRKQFADIAYNYRHGQPIPRVEYMEEEKKTWGTVFKTLKSLYKT HACYEYNHIFPLLEKYCGFHEDNIPQLEDVSQFLQTCTGFRLRPVAGLLS SRDFLGGLAF

39、RVFHCTQYIRHGSKPMYTPEPDICHELLGHVPLFSDRSFA QFSQEIGLASLGAPDEYIEKLATIYWFTVEFGLCKQGDSIKAYGAGLLSSF GELQYCLSEKPKLLPLELEKTAIQNYTVTEFQPLYYVAESFNDAKEKVRN FAATIPRPFSVRYDPYTQRIEVLDNTQQLKILADSINSEIGILCSALQKIK Databases: PIR and SWISS-PROT; PDB and PAHdb Tools: BLAST or FASTAPROCEDURE Step1. Obtaining the sequenc

40、es of interest and examine the results (not necessary in this experiment) Search PubMed, a public version of full Medline for topics of interest (US). Search a variety of sequence and structures databases using the SRS server at EMBL (Germany) or Entrez server at NCBI (US). Search PIR (or WISS-PROT)

41、 for sequences of interest (US). Step2. A exercise of searching In the exercise given below, you will integrate the knowledge you have gained from last experiment and classroom. You should also realize how easy it is to use other databases and related sources of information, particularly now that yo

42、u have an understanding of the molecular databases. 1. Go to a sequence alignment program of your choice. You might choose to use:Experiments in Bioinformatics (生物信息学实验指导)The EXPASY BLAST server. Or the GeneStream FASTA server. 2. Copy the human amino acid sequence (given in the one letter code). 3.

43、 Paste the sequence into the query sequence window and adjust the options as necessary. You wont need to specify advanced options, but you should choose a program and database. For simplicity, please use the main SWISS- PROT database. You may wish to try other databases, but you should return to SWI

44、SS-PROT when continuing with this exercise. 4. Run the search and identify the protein. Select the following: Gapped Alignment: ON; BLAST filter: ON; Graphic Output: ON. Matrix: blosum62 5. Use the link provided to see the SWISS-PROT report. If the link fails for any reason, you can do a text search

45、 of SWISS-PROT. Go to SWISS-PROT and search by the identifier you identified after the BLAST or FASTA search. Step3. Answer the following questions and correct them Now, try to answer all of the questions below. You may need to look at pages that are linked from the SWISS-PROT report, but you will n

46、ot need to search further than the first page of any site. Answering all of the questions may take some time, but you will get a feel for what is available, and how to get it. You may even find yourself becoming fascinated by the report, and exploring on your own! Write down the answers, and see if

47、you got them right by comparing your answers to the correct answers on the next page. 1. What is the SWISS-PROT name of the entry? 2. What is the SWISS-PROT primary accession number? 3. What is the most common name of the protein? 4. What is the gene called? 5. Which year was the crystal structure o

48、f the catalytic domain determined? Name the first name author. 6. Does the enzyme require a co-factor to function? If so, what? 7. Name the most common disease that arises as a result of deficiency of this enzyme. 8. Which cytogenetic locus does the gene reside at? (e.g. 13p10.1) 9. What is the PAHd

49、b? 10. How many amino acid residues are there in the protein? 11. What is the molecular weight of the protein? 12. More tasks (if you can): Look briefly at entries in GeneCards, MIM (Mendelian Inheritance in Man), obtain the nucleic acid sequence and locate a FASTA report for the protein sequence. Vi

展开阅读全文
相关资源
相关搜索

当前位置:首页 > 应用文书 > 教育教学

本站为文档C TO C交易模式,本站只提供存储空间、用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。本站仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知得利文库网,我们立即给予删除!客服QQ:136780468 微信:18945177775 电话:18904686070

工信部备案号:黑ICP备15003705号-8 |  经营许可证:黑B2-20190332号 |   黑公网安备:91230400333293403D

© 2020-2023 www.deliwenku.com 得利文库. All Rights Reserved 黑龙江转换宝科技有限公司 

黑龙江省互联网违法和不良信息举报
举报电话:0468-3380021 邮箱:hgswwxb@163.com