1 / 46

人类群体遗传学 基本原理和分析方法

中国科学院上海生命科学研究院研究生课程 人类群体遗传学. 人类群体遗传学 基本原理和分析方法. 中科院 - 马普学会计算生物学伙伴研究所. 徐书华 金 力. 第八讲. 人群遗传结构分析 ( II ). 第八讲. 人群分化与遗传多样性 STRUCTURE 分析 文件格式 参数设定 结果解释 软件展示 STRUCTURE 2.2.3. 人群遗传结构分析. 人群遗传结构分析 Gene tree based AMOVA (hierarchical F statistics) Factor analysis

jeb jeb
Download Presentation

人类群体遗传学 基本原理和分析方法

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 中国科学院上海生命科学研究院研究生课程人类群体遗传学中国科学院上海生命科学研究院研究生课程人类群体遗传学 人类群体遗传学基本原理和分析方法 中科院-马普学会计算生物学伙伴研究所 徐书华 金 力

  2. 第八讲 人群遗传结构分析(II)

  3. 第八讲 • 人群分化与遗传多样性 • STRUCTURE分析 • 文件格式 • 参数设定 • 结果解释 • 软件展示 • STRUCTURE 2.2.3

  4. 人群遗传结构分析 • 人群遗传结构分析 • Gene tree based • AMOVA (hierarchical F statistics) • Factor analysis • Principle Component analysis • STRUCTURE analysis

  5.   

  6.   

  7. Geographical distribution HGDP samples (52 populations)

  8. Previous genome-wide data in HGDP panel • Science 2002 • 52 populations, 1,056 individuals • 377 autosomal STRs • Plos Genet 2005 • 52 populations, 1,048 individuals • 783 STRs, 210 indels • Nature Genetics 2006 • 52 populations, 927 individuals • 3,024 SNPs in 36 genomic regions

  9. NIH & University of Michigan Stanford University

  10. Genotype, haplotype and copy-number variation in worldwide human populations • Study design: • Genome-wide patterns of variation; • Fine-scale population structure. • Data structure: • 29 HGDP populations, 485 individuals. • 4 HapMap populations, 112 individuals. • 525,910 SNPs, 396 CNVs (Illumina HumanHap550K). • New findings: • Increasing linkage disequilibrium is observed with increasing geographic distance from Africa (a serial founder effect). • The global distribution of CNVs largely accords with population structure analyses for SNP data sets of similar size. • Conclusions: • Support the utility of CNVs in human population-genetic research.

  11. Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation • Study design: • Human genetic diversity; • Fine-scale population structure. • Data structure: • 51 populations; 938 individuals. • 650,000 SNPs (Illumina HumanHap650K). • New findings: • The relationship between haplotype heterozygosity and geography was consistent with the hypothesis of a serial founder effect with a single origin in sub-Saharan Africa. • Observed a pattern of ancestral allele frequency distributions that reflects variation in population dynamics among geographic regions. • Conclusions: • This data set allows the most comprehensive characterization to date of human genetic variation. Individual ancestry and population substructure are detectable with very high resolution.

  12. NJ tree based on SNP genotypes

  13. Population structure inferred by STRUCTURE

  14. Maximum likelihood tree of 51 populations Oceania America 150,000 SNPs East Asia South/Central Asia Europe Middle East North Africa

  15. MDS plots

  16. MDS plots of individuals SNP Haplotype CNV

  17. MDS Chrom 21 220 SNPs Nei’s DA

  18. PCA plots

  19. PCA of populations

  20. PCA of individuals

  21. STR can not, SNP can Europe Middle East

  22. Han and Northern Han

  23. 56 ethnic groups in China

  24. Genetic structure of language families

  25. Two types of genetic structure

  26. All other Han Chinese Shy blue: CN-GA CN-PH Olive green: TW-HA TW-HB Brown: SG-CH

  27. Inference on population structure using multi-locus genotype dataSTRUCTURE V2.2.3 Pritchard, Stephens, and Donnelly (2000) Falush, Stephens, and Pritchard (2003)

  28. Main objective • Assign individuals to populations on the bases of their genotypes, while simultaneously estimating population allele frequencies

  29. Other objectives • Begin with a set of predefined populations and to classify individuals of unknown origin • Identify the extent of admixture of individuals • Infer the origin of particular loci in the sampled individuals

  30. Structure is a Model Based method of clustering (we must be assumptions about a lot of parameters and distributions)

  31. Four basic models • Model without admixture each individual is assumed to originate in one (only one) of K populations • Model with admixture each individual is assumed to have inherited some proportion of its ancestry from each of K populations

  32. Four basic models • Linkage model “Chunks” of chromosomes as derived as intact units from one or another K population and all allele copies on the same “chunk” derive from the same population. The model consider the derived correlations in ancestry

  33. Four basic models • F model The populations all diverged from a common ancestral population at the same time, but allows that the populations may have experienced different amounts of drift since the divergence event

  34. Assumptions • “Our main modeling assumptions are Hardy-Weinberg equilibrium within populations and complete linkage equilibrium between loci within populations” • “Loosely speaking, the idea here is that the model accounts for the presence oh HWD or LD by introducing population structure and attempts to find populations groupings that (as far as possible) are not in disequilibrium”

  35. Data • Consider a sample of N individuals each one genotyped at L loci • Assume that the individuals represent a mixture of K unobserved populations (K unknown) • If diploid, we have an N×2L data matrix X • If n-ploid X is N× where Jl is the number of alleles at the lth locus

  36. Input file format

  37. Parameter setting • Main parameters (mainparams.txt) • Extra parameters (extraparams.txt)

  38. 软件演示 (structure)

  39. Summary plot of estimates of individual membership fraction

  40. 常用软件 • STRUCTURE • http://pritch.bsd.uchicago.edu/software/structure2_2.html • EIGENSOFT • http://genepath.med.harvard.edu/~reich/Software.htm • SPSS

  41. 练习 • 利用HapMap数据进行STRUCTURE分析; • http://www.hapmap.org

More Related

玻璃钢生产厂家江苏走廊商场美陈批发价苏州玻璃钢花盆价格云浮玻璃钢卡通雕塑上海玻璃钢气球雕塑阿勒泰玻璃钢雕塑广西玻璃钢雕塑定做周口景观园林校园玻璃钢雕塑厂家九江玻璃钢景观雕塑玻璃钢雕塑抽象鹿户外商场美陈制造园林玻璃钢景观雕塑定做价格丽水人物玻璃钢雕塑销售电话铜黄色杯状玻璃钢花盆梅州玻璃钢动物雕塑要求季节性商场美陈厂家直销山东超市商场美陈广州商场美陈招标高淳商场户外美陈商场中庭互动装置美陈新民玻璃钢牌匾雕塑杭州玻璃钢雕塑摆件供货商湖北水果玻璃钢雕塑销售电话镜面玻璃钢雕塑价格行情德州不锈钢人物玻璃钢雕塑玻璃钢仿真动植物雕塑玻璃钢钢铁雕塑连云港玻璃钢花盆花器菏泽玻璃钢喷泉不锈钢雕塑玻璃钢雕塑为什么那么厉害商场美陈2021香港通过《维护国家安全条例》两大学生合买彩票中奖一人不认账让美丽中国“从细节出发”19岁小伙救下5人后溺亡 多方发声单亲妈妈陷入热恋 14岁儿子报警汪小菲曝离婚始末遭遇山火的松茸之乡雅江山火三名扑火人员牺牲系谣言何赛飞追着代拍打萧美琴窜访捷克 外交部回应卫健委通报少年有偿捐血浆16次猝死手机成瘾是影响睡眠质量重要因素高校汽车撞人致3死16伤 司机系学生315晚会后胖东来又人满为患了小米汽车超级工厂正式揭幕中国拥有亿元资产的家庭达13.3万户周杰伦一审败诉网易男孩8年未见母亲被告知被遗忘许家印被限制高消费饲养员用铁锨驱打大熊猫被辞退男子被猫抓伤后确诊“猫抓病”特朗普无法缴纳4.54亿美元罚金倪萍分享减重40斤方法联合利华开始重组张家界的山上“长”满了韩国人?张立群任西安交通大学校长杨倩无缘巴黎奥运“重生之我在北大当嫡校长”黑马情侣提车了专访95后高颜值猪保姆考生莫言也上北大硕士复试名单了网友洛杉矶偶遇贾玲专家建议不必谈骨泥色变沉迷短剧的人就像掉进了杀猪盘奥巴马现身唐宁街 黑色着装引猜测七年后宇文玥被薅头发捞上岸事业单位女子向同事水杯投不明物质凯特王妃现身!外出购物视频曝光河南驻马店通报西平中学跳楼事件王树国卸任西安交大校长 师生送别恒大被罚41.75亿到底怎么缴男子被流浪猫绊倒 投喂者赔24万房客欠租失踪 房东直发愁西双版纳热带植物园回应蜉蝣大爆发钱人豪晒法院裁定实锤抄袭外国人感慨凌晨的中国很安全胖东来员工每周单休无小长假白宫:哈马斯三号人物被杀测试车高速逃费 小米:已补缴老人退休金被冒领16年 金额超20万

玻璃钢生产厂家 XML地图 TXT地图 虚拟主机 SEO 网站制作 网站优化