Transcription and RNA Processing in living cells
Cells express an astonishing variety of mRNA transcripts from a limited pool of genes. Across tissues and even among individuals, mRNAs produced from the same gene differ at their 5’ and 3’ ends as well as throughout the transcript body, enabling the expression of numerous protein products per gene. Transcript diversity is due to regulation of transcription and splicing, which we investigate in vivo. We have established experimental systems in budding yeast, zebrafish embryos, and mammalian tissue culture cells to explore transcription and splicing regulation in a variety of biological contexts and with a diversity of tools, from imaging to genome-wide approaches. Our observations have provided novel insights into transcription and splicing mechanisms as well as principles of cellular organization that facilitate efficient gene expression.
Coordination of Transcription and Splicing
All protein-coding genes are transcribed by RNA polymerase II (Pol II); the resulting pre-mRNA transcripts are spliced by a distinct macromolecular machine, the spliceosome, to produce mRNA. These two reactions, transcription and splicing, occur independently of one another in vitro. We have used “splicing factor ChIP”, which we developed, to show that the spliceosome assembles while the nascent transcript is attached to chromatin by Pol II. Thus, transcription and chromatin have the potential to influence splicing outcome in vivo. Current projects investigate the roles of regulatory factors and chromatin modifications in determining splicing efficiency and which of the diverse number of alternative transcripts are expressed by cells.
The elusive question in the field has been whether transcription and splicing are directly coupled. Using a genome-wide approach in budding yeast, we have recently discovered that Pol II pauses within terminal exons to yield highly efficient co-transcriptional splicing. Until now, Pol II pausing has only been thought to regularly occur during transcription initiation and termination. The phenomenon of terminal exon pausing indicates that specific mechanisms have evolved to directly couple transcription and splicing. We plan to determine the molecular mechanism of terminal exon pausing and how co-transcriptional splicing fundamentally contributes to gene expression.
Cajal bodies and the macromolecular assembly of RNPs
Cajal bodies (CBs) were identified more than 100 years ago by Ramon y Cajal in vertebrate neurons. The function of these 0.5-1 mm spherical structures, which like other cellular subcompartments (PML bodies, P bodies, P granules, stress granules, nucleoli) lack membranes, has been mysterious. Do these bodies have functions per se? Or are they just sticky places where molecules collect? Using live-cell imaging, we have shown that assembly of the macromolecular splicing complexes – the spliceosomal snRNPs – occurs in CBs. Mathematical modeling predicted that snRNP assembly is ~10-fold more efficient when CBs are present; this suggested that CBs increase the efficiency of gene expression by facilitating splicing.
We established the zebrafish embryo as a model to test CB function. Combining high resolution imaging in live embryos, targeted knockdown, sophisticated biochemistry, and molecular biology techniques, we identified an essential function of CBs. Loss of CBs resulted in splicing defects and embryonic lethality, due to an inability to assemble sufficient snRNPs. Thus, CBs promote efficient macromolecular assembly of snRNPs. This work reveals a novel element in cellular logistics, in which CBs and likely other such compartments facilitate macromolecular assembly by concentrating interacting components without the diffusional barrier of membranes. We wonder whether the CB provides a “catalytic surface” for macromolecular assembly, perhaps by aligning interaction partners in favorable orientations. We are taking in vivo and in vitro approaches to understand the structure and molecular function of CBs in snRNP assembly.
mRNP formation, composition and function
Genomes encode many hundreds of RNA binding proteins that have roles in transcription, splicing, subcellular localization, stability and translation. Yet we do not have a comprehensive handle on how they work. Each mRNA is bound by numerous RNA binding proteins during its lifetime. How do nascent and mature mRNPs assemble? What is their composition? What are the specific functions of mRNP components in gene expression? These questions currently represent a black box in our knowledge of gene expression.
My lab studies a family of essential RNA binding proteins, the SR proteins, as representatives of this class of regulators. We established physiological expression of tagged versions of each SR protein on bacterial artificial chromosomes (BACs) stably integrated into multipotent murine cell lines. The uniform tag on each protein facilitates biochemical purification of SR protein-specific mRNPs, from which protein and RNA components are analyzed. We identified the mRNA cargoes of SR proteins in cycling and neural cells and found that individual SR proteins associate with a discrete set of mRNAs that changes upon neural differentiation. Many target mRNAs required the cognate SR protein for their expression. Identification of mRNP components in cycling and neural cells by mass spectrometry is in progress. Targeted depletion of individual SR proteins leads to discrete, largely non-overlapping changes in alternative splicing. Our vision is that the SR proteins provide an opportunity to systematically determine the role of RNA-binding proteins in each step of gene expression, because we can compare and contrast family members that are structurally highly related. We are currently generating large genome-wide datasets to provide insight into the function of SR proteins at all phases of gene expression.
|© 2008-2013 MPI-CBG, Imprint, Intranet|