Transcriptome and genome sequencing uncovers functional variation in humans.
Lappalainen T., Sammeth M., Friedländer MR., 't Hoen PAC., Monlong J., Rivas MA., Gonzàlez-Porta M., Kurbatova N., Griebel T., Ferreira PG., Barann M., Wieland T., Greger L., van Iterson M., Almlöf J., Ribeca P., Pulyakhina I., Esser D., Giger T., Tikhonov A., Sultan M., Bertier G., MacArthur DG., Lek M., Lizano E., Buermans HPJ., Padioleau I., Schwarzmayr T., Karlberg O., Ongen H., Kilpinen H., Beltran S., Gut M., Kahlem K., Amstislavskiy V., Stegle O., Pirinen M., Montgomery SB., Donnelly P., McCarthy MI., Flicek P., Strom TM., Geuvadis Consortium None., Lehrach H., Schreiber S., Sudbrak R., Carracedo A., Antonarakis SE., Häsler R., Syvänen A-C., van Ommen G-J., Brazma A., Meitinger T., Rosenstiel P., Guigó R., Gut IG., Estivill X., Dermitzakis ET.
Genome sequencing projects are discovering millions of genetic variants in humans, and interpretation of their functional effects is essential for understanding the genetic basis of variation in human traits. Here we report sequencing and deep analysis of messenger RNA and microRNA from lymphoblastoid cell lines of 462 individuals from the 1000 Genomes Project--the first uniformly processed high-throughput RNA-sequencing data from multiple human populations with high-quality genome sequences. We discover extremely widespread genetic variation affecting the regulation of most genes, with transcript structure and expression level variation being equally common but genetically largely independent. Our characterization of causal regulatory variation sheds light on the cellular mechanisms of regulatory and loss-of-function variation, and allows us to infer putative causal variants for dozens of disease-associated loci. Altogether, this study provides a deep understanding of the cellular mechanisms of transcriptome variation and of the landscape of functional variants in the human genome.