The Nuclear Factor I family of proteins (NFI) bind to specific sites on duplex DNA and are important both for viral DNA replication and the transcription of cellular genes. In vertebrates, four NFI genes (the NFI-A, NFI-B, NFI-C and NFI-X paralogous groups) are highly conserved from avians to humans. However, both degenerate PCR and hybridization of the conserved DNA-binding domain to genomic DNA and Yac blots detects only a single NFI gene in C. elegans (designated
nfi-1 on cosmid ZK1290). By RT-PCR we cloned cDNAs from the
nfi-1 gene and showed that: 1) the gene is alternatively spliced in C. elegans, 2) Genefinder exon predictions are ~80% correct in this gene, and 3)
nfi-1 shares a homologous 3' splice acceptor site with the mouse NFI-A and NFI-C genes. Surprisingly however, while all 4 NFI genes in vertebrates have an N-terminal DNA-binding domain encoded by a single exon, the
nfi-1 gene has four introns embedded in this conserved domain. To assess the DNA-binding properties of C. elegans NFI protein (CeNFI), we expressed cDNAs from the putative DNA-binding domain in E. coli and partially purified the protein using an N-terminal His6 tag and Ni-affinity chromatography. In vitro, CeNFI bound to the same well-defined NFI binding site as does human NFI-C as measured in a gel mobility shift assay. Also, a single point mutation in the site that abolished NFI-C binding also abolished CeNFI binding. To measure CeNFI expression in vivo, we created an
nfi-1-lacZ translational fusion vector and assessed lacZ activity in transgenic worms. CeNFI-lacZ activity was detected in specific cells only during embryogenesis in C. elegans with no expression seen in larva or adult hermaphrodites. Analysis of endogenous
nfi-1 mRNA by in situ hybridization shows that it is present from the one-cell to the 1.5-fold stage, is lost at later stages, but reappears in adult gonads. These findings are in contrast to our observation that all four NFI genes in the mouse are expressed in most adult tissues. We are currently assessing the functional significance of this phylogenetic difference in the expression of this highly conserved transcription/replication protein.