Natural variation in gene expression is a major source of phenotypic diversity among individuals. Genetic variants underlying gene expression differences are known as expression quantitative trait loci (eQTL). In Caenorhabditis elegans, both local eQTL (located close to the genes they influence) and distant eQTL (located farther away from the genes they influence) have been identified using linkage analysis from a panel of recombinant inbred advanced intercross lines (RIAILs) between the laboratory reference strain, N2, and a wild strain, CB4856. However, the eQTL detected using RIAILs were limited to genetic variants between N2 and CB4856. Here, we investigated the natural variation in gene expression of 205 genetically distinct C. elegans wild strains by performing RNA-sequencing on synchronized young adult hermaphrodites. We obtained reliable expression of 25,896 protein-coding transcripts (16,106 genes). We used genome-wide association (GWA) analysis to identify 3,342 local eQTL for 3,342 transcripts (2,777 genes), and 2,835 distant eQTL for 2,206 transcripts (2,082 genes). We found that most of the narrow-sense heritability for transcript expression variation is explained by detected eQTL. Of the 2,835 distant eQTL, 1,670 eQTL significantly clustered in 54 hotspots across the C. elegans genome. We will further explore causal genes underlying these distant eQTL hotspots and their functions. Additionally, we applied mediation analysis to the eQTL data and other organism-level quantitative traits to elucidate the genetic effects on phenotypic variation mediated by gene expression. For instance, instead of relying on time-consuming investigations of rare genetic variants missed in GWA studies and prior knowledge of the trait as published in our study of C. elegans responses to benzimidazoles (Hahnel et al. 2018), the significant mediating effect of the expression of
ben-1, the causal gene, was quickly and successfully identified by mediation analysis. Our results suggest that mediation analysis using expression data facilitates identification of causal genes in GWA studies in C. elegans.