supamas: 2009

วันศุกร์ที่ 25 ธันวาคม พ.ศ. 2552

Assignment 3

Who are the ancestors of the dinosaurs?
Science 1994 Nov 18;266(5188):1229-1232

DNA was extracted from 80-million-year-old bone fragments found in strata of the Upper Cretaceous Blackhawk Formation in the roof of an underground coal mine in eastern Utah.

This DNA was used as the template in a polymerase chain reaction that amplified and sequenced a portion of the gene encoding mitochondrial cytochrome b.

These sequences differ from all other cytochrome b sequences investigated, including those in the GenBank and European Molecular Biology Laboratory databases.

DNA isolated from these bone fragments and the resulting gene sequences demonstrate that small fragments of DNA may survive in bone for millions of years.

The authors conclude that the DNA sequence,

cccttctattattcattctcattctattcgttattcttgtactccacacatccaaacaacaaag
cataatattccacccattgagtccattcctatcctgattcttagtccccgaaccttttacactcacatg
,appears to be from a dinosaur that lived 80 million years ago.

Show us step by step of how to do phylogenetic analysis with cytochrome b sequences. Then, what is your conclusion about the structure of the tree and the position of the dinosaur sequence that might come close to these following species?

Use these following species
o Human
o Dog
o Rabbit
o rhinoceros
o dugong
o mouse
o whale
o bovine
o sicklebill
o chicken
o magpie
o frog

Hint: You need to create a phylogentic tree. But first you need to get the cytochrome b nucleotide sequences for the different species from NCBI.
ขั้นตอนการทำ
1. เข้าไปยังเว็บไชต์ style="font-family:times new roman;">http://www.ncbi.nlm.nih.gov/ ดังรูป

2. เลือก nucleotide และใส่ keyword ที่ต้องการลงไป

span >3.สำหรับการค้นหาและการเลือก sequences
3.1ใส่คำค้นชื่อสิ่งมีชีวิตและsequence ที่ต้องการ (เอาเฉพาะ Ref. sequ. ที่เป็น complete genome จาก mitochondrion) เช่นเราต้องการค้นหา sequence ของ Human จะได้ว่า "Homo sapiens and cytochrome b and reference sequence"เป็นต้น
3.2สำหรับสิ่งมีชีวิตที่ไม่สามารถหา mitochondrion complete genome โดยเฉพาะ cytochrome b ก็จะเลือกใช้ข้อมูลที่สมบรูณ์ที่สุด โดยตัดคำว่า reference sequence ออกจากการหา
3.3 เมื่อหา sequence ของสิ่งมีชีวิตที่ต้องการได้แล้วให้ทำการ download sequence ในรูป fasta ทำการ save ไว้ในคอมพิวเตอร์ ซึ่งแสดงให้เห็นดังรูป

4.เข้าไปที่ http://www.google.co.th/

5. พิมพ์ bioedit free download

6. เลือก bioedit free download

7. ทำการ download program bioedit และ save program ไว้ในเครื่องคอมพิวเตอร์

8. ทำการเปิดไฟล์ที่ทำการ save ไว้และทำการติดตั้ง program bioedit(set up)

9. ทำการเปิด program bioedit แสดงหน้าจอแรกดังรูป

10. จากนั้นเลือก import ไฟล์ของสิ่งมีชีวิตที่ต้องการ(ที่ได้จากข้อ 3.3) ดังรูป

11. จะได้ sequence ดังรูป

12.จากนั้นให้เลือกไปที่คำสั่งในเมนู Accessory Application--> ClustalW Multiple alignment
---> Run ClustalW

13. save text ---->Accessory Application แล้วเข้าไปที่
DNAmk DNA Maximum likelihood program with molecular clock

14.จะได้ Data ดังภาพ หลังจากนั้น save as text file

15.เปิดโปรแกรม Tree view แล้วเปิด file ที่save ไว้ก่อนหน้านั้น

16.โปรแกรมจะเริ่มทำงานและรายงานผลออกมาในรูปแบบ graphic ของ phylogenetic tree

วันพุธที่ 25 พฤศจิกายน พ.ศ. 2552

Assignment 2

find out
1.What is the name of haploview format to use in this analysis?
2.Please show us the marker and individual quality control of the genotype data use in the analysis?
3.Please show us the LD map then explain what do you get from the LD map?
4.How many haplotype blocks in this region of Chromosome X, then explain how to interpret them?
5.Could you find out the tagging SNP in each haplotype block, then explain what the tagging SNPs?

สิ่งที่จำเป็นต้องมี
I. program Haploview 4.1 ซึ่งสามารถ download ได้จาก http://www.broadinstitute.org/haploview/haploview-downloads

II. program Java โดย download ได้จาก http://www.java.com/en/

ขั้นตอนการใช้งาน program Haploview

1. นำไฟล์ที่มีข้อมูลที่ต้องการมาใส่ลงในโปรแกรม microsoft word และทำการบันทึกให้เป็นแบบ plain text

2. เปิดโปรแกรม haploview ดังรูป

3. เลือก Hapmap Format และทำการ browse ไฟล์ที่ต้องการจากข้อ 1 และกด "OK"

4. โปรแกรม Haploview จะทำการ load ไฟล์ที่ต้องการขึ้นมาดังรูป

5. ขั้นตอนต่อไปเลือกที่ Check Markers แล้วทำการเปลี่ยนค่า HW p-value cutoff จาก 0.0010 เป็น 0.05 (ที่ค่าความเชื่อมั่นที่ 95 % ) แล้วกด Rescore Markers ค่า p-value ที่น้อยกว่า 0.05 จะขึ้นแถบสีแดง ส่วนค่า p-value ที่เราต้องการนั้นจะขึ้นสีดำ

6. จากนั้นเปิดหน้า LD Plot จะเห็นว่ามี 3 blocks ด้วยกันที่จะสามารถเลือกนำมาศึกษาได้ซึ่งบริเวณของ block ทั้ง 3 จะอยู่ภายใต้พื้นที่ที่มีสีแดง

7. จากนั้นทำการเปิดหน้า Haplotypes จะเห็นว่ามีตัวเลข 3 blocks ด้วยกันที่อยู่บนโครโมโซม X นี้

Block ที่ 1 มี 8,9

Block ที่ 2 มี 13,14,15,16,17

Block ที่ 3 มี 24,25,26,27,28,29
ซึ่งตัวเลขในหน้านี้จะตรงกับตัวเลขที่อยู่ในหน้า LD Plot

8. เลือก Display แล้วเลือก Show tags in blocks สรุปจะได้ tagging SNPs ซึ่งสามารถนำไปใช้ศึกษาได้คือ

Block ที่ 1 8,9

Block ที่ 2 13,15

Block ที่ 3 24,27

วันอังคารที่ 24 พฤศจิกายน พ.ศ. 2552

Assignment 1

To construct the workflow from Taverna program.
1. To open Taverna program. You can see the three part of Taverna program on the first page.

2. Go to file and open the new workflow.

3. Put “ fasta ” in the box search. Choose “ Get Nucleotide FASTA ” and click “ add to the model ”.

4. You can see “ Get Nucleotide FASTA ” in the diagram pane. And you can create the name of new input.

5. The data of input show in diagram pane.

6. Put “ regular ” in the box search. Choose “ Split string into string list by regular expression” and click “ add to the model”.

7. The data of “Split string into string list by regular expression” show in diagram pane.

8. Make connection between workflow inputs and Split string into string list by regular expression. Choose “ Workflow Input Accession_Number ”, select Split string into string list by regular expression and click in the following picture below.

9. The connection show in diagram plane.

10. Make connection between Split string into string list by regular expression and Get Nucleotide FASTA. Choose “ Connect output split ”, select “ Get Nucleotide FASTA ” and click choose an input.

11. The connection show in diagram plane.

12.In the “ repex ”, you can set default value. Input "GU" in the box.

13. You can create new output. And add the name of output data.

14. The data of workflow outputs show in diagram pane.

15. Make connection between Workflow output and Get nucleotide FASTA. Choose “ Output in Get nucleotide FASTA ”, select workflow output and click at fasta sequences.