[
Methods Mol Biol,
2011]
PhospoPep version 2.0 is a project to support systems biology signaling research by providing interactive interrogation of MS-derived phosphorylation data from four different organisms. Currently the database hosts phosphorylation data from the fly (Drosophila melanogaster), human (Homo sapiens), worm (Caenorhabditis elegans), and yeast (Saccharomyces cerevisiae). The following will give an overview of the content and usage of the PhosphoPep database.
[
Methods Cell Biol,
1995]
ACeDB (A Caenorhabditis elegans Data Base) is a data management and display system that contains a wide range of genomic and other information about C. elegans. This chapter provides an overview of ACeDB for the C. elegans user, focusing in particular on the Macintosh version Macace. Previous reviews of AceDB include those of Thierry-Mieg and Durbin (1992) and Durbin and Thierry-Mieg (1994), which describe the general properties of the whole system, and that by Dunham et al. (1994), which discussed the use of AceDB for physical map data collection and assembly. ACeDB was developed by Jean Thierry-Mieg and Richard Durbin primarily for the C. elegans project, when the genomic sequencing project was just beginning in 1990. The original aim was to create a single database that integrated the genetic and physical maps with both genomic sequence data and the literature references. The forerunner of ACeDB was the program CONTIG9 (Sulston et al., 1988), which was developed to maintain and edit the physical map. CONTIG9 served researchers around the world by providing critical on-line access to the current physical map as it was being constructed (Coulson et al., 1986). This policy of immediate access allowed members of the worm community to see the same data as the people making the map, and proved very successful in maximizing use of the map. The same approach was adopted as a template for ACeDB. These two principles, developing a comprehensive database for all types of genomic and related data and providing public access to the data in the same form as used by the data-collecting laboratories, have continued to underlie developments of ACeDB. Over the last 5 years, a wide range of genome projects relating to other organisms have taken the ACeDB program and used it to develop databases for their own data. ACeDB has been used both in public projects designed to redistribute public data in a coordinated fashion and laboratory-based projects for collecting new data. Others, such as the C. elegans ACeDB, have used the database for both purposes. The reason it has been possible to adapt ACeDB so widely is that its flexible data structure allows new types of objects and new types of information about these objects to be added easily. This chapter describes (1) how to obtain ACeDB and documentation for it, (2) how to access and use the information in ACeDB, and (3) how to use ACeDB as a laboratory-based data managing system. Some of what we discuss is specific to the nematode database, but other information applies to the basic computer software program and, hence, to any database using the ACeDB program.
[
WormBook,
2006]
The completion of the C. elegans genome sequence permits the comprehensive examination of the expression and function of genes. Annotation of virtually every encoded gene in the genome allows systematic analysis of those genes using high-throughput assays, such as microarrays and RNAi. This chapter will center on the use of microarrays to comprehensively identify genes with enriched expression in the germ line during development. This knowledge provides a database for further studies that focus on gene function during germline development or early embryogenesis. Additionally, a comprehensive overview of germline gene expression can uncover striking biases in how genes expressed in the germ line are distributed in the genome, leading to new discoveries of global regulatory mechanisms in the germ line.
[
Lecture Notes in Computer Science,
2005]
The OMA project is a large-scale effort to identify groups of orthologs from complete genome data, currently 150 species. The algorithm relies solely on protein sequence information and does not require any human supervision. It has several original features, in particular a verification step that detects paralogs and prevents them from being clustered together. Consistency checks and verification are performed throughout the process. The resulting groups, whenever a comparison could be made, are highly consistent both with EC assignments, and with assignments from the manually curated database HAMAP. A highly accurate set of orthologous sequences constitutes the basis for several other investigations, including phylogenetic analysis and protein classification.
[
1997]
Caenorhabditis elegans aquatic toxicity assays were standardized with five common reference toxicants: CdCl2, NaCl, KCl, sodium lauryl sulfate (SLS), and sodium pentachlorophenate (PCP). Aquatic toxicity testing was conducted in 3 media: a standard C. elegans medium; EPA moderately hard reconstituted water; and EPA moderately hard mineral water. Test duration in each medium was 24h without a food source, and 24h and 48h with Escherichia coli strain OP50 as a food source. Each test was replicated three times with each replicate having 6 wells per concentration, 10 worms per well. LC50 values were calculated using probit analysis. The average LC50s for each set of replicants were compared to assess sensitivity and reproducibility of the data, identifying expected variation between replicate tests. These reference toxicants increase the database for C. elegans and provide a benchmark for further application.