Although many advances have been made by studying individual genes, the global picture of tissue-specific gene expression in an organism remains dim. The relatively simple body plan, yet representative of many specific tissues, and complete genome sequence of the nematode C. elegans allows for a systems biology approach to the question of which genes specify a tissue. A global profile of the tissue-specific gene expression of this organism would expand our knowledge of tissue development and maintenance, regulation of gene expression, higher order chromosome structure, and the housekeeping genes that characterize a cell. In order to identify the genes expressed in the intestine of the worm we have taken the mRNA-tagging approach described by Roy et al (2002) to identify genes expressed in muscle cells. Animals expressing FLAG::PAB-1 from the intestine-specific promoter
ges-1 were used to immunoprecipitate FLAG::PAB-1/mRNA complexes from the intestine and the number of enriched genes relative to whole worm lysate was determined by microarrays. The average ranks from eight repeats of this experiment identified 1938 intestine-enriched genes. First, we compared the genes expressed in the intestine to those expressed in muscle (Roy et al. 2002), and thereby identified 807 genes expressed in both tissues and 645 genes enriched in the intestine versus muscle. Second, we showed that the 1938 intestine-enriched genes were also positionally clustered on the chromosomes, suggesting that the order of genes in the genome is influenced by the effect of chromatin domains on gene expression. Furthermore, the tissue-specific lists showed less chromosomal clustering than the list of genes expressed in both the intestine and muscle. This observation suggests that chromatin domains may influence housekeeping genes more than tissue-specific genes. Third, in order to gain further insight into the regulation of expression of intestine-enriched genes, we searched for regulatory motifs in the set of intestine-enriched genes. We used a modified Gibbs sampling method called CompareProspector (Liu, 2004) to identify motifs that were conserved with C. briggsae and over-represented in the 1kb upstream region of the 645 intestine-specific genes. This analysis found that the promoter regions of the intestine genes were enriched for the consensus sequence for GATA transcription factors at a rate two-fold over random. In order to test the functionality of the GATA motif, we are using transcriptional GFP fusions of several intestinal markers with wild type and mutated GATA sites. Liu, Y., Liu, X. S., Wei, L., Altman, R. B., & Batzoglou, S. (2004). Eukaryotic regulatory element conservation analysis and identification using comparative genomics. Genome Research, 14(3), 451-458. Roy, P. J., Stuart, J. M., Lund, J., & Kim, S. K. (2002). Chromosomal clustering of muscle-expressed genes in Caenorhabditis elegans. Nature, 418(6901), 975-979.