| Home | About WormBook | Author Instructions | Sponsors | e-Alerts | Feedback | WormAtlas | WormBase |
Table of Contents
Abstract
The regulation of transcription in C. elegans shares many similarities to transcription in other organisms. The details of how specific transcription factors bind to target promoters and act as either activators or repressors are still being examined in many cases, but an increasing number of factors and their binding sites are being characterized. This chapter reviews the general concepts that have emerged with regards to promoter function in C. elegans. Included are the methods that have been successfully employed as well as limitations encountered to date. Specific cis-acting promoter elements from myo-2, hlh-1 and lin-26 are discussed as examples of complex promoters regulated by multiple sequence elements. In addition, examples of organ-, tissue-, and cell type-specific mechanisms for generating spatial specificity in gene expression are discussed.
Regulation of Polymerase II (Pol II) transcription in C. elegans can be described as typical for eukaryotes. Pol II appears to act in concert with TATA Binding Protein (TBP) and TBP-Associated Factors (TAFs) at the promoter of protein coding genes (Dantonel et al., 2000; Kaltenbach et al., 2000; Lichtsteiner and Tjian, 1993; Walker et al., 2004). Active Pol II is phosphorylated on the C-terminal domain (CTD) at serine 2 and 5 like other eukaryotes (Seydoux and Dunn, 1997; Wallenfang and Seydoux, 2002; Zhang et al., 2003). The functions of these proteins at the core of transcription are beginning to be defined and are reviewed in Transcription mechanisms. However, there are many things about transcription in C. elegans that we do not yet know for sure. For example, putative TATA and CAT boxes upstream of the coding region are often described, but this is largely done subjectively without any firm experimental evidence for the function of these elements. We also have not fully explored the role of histone modifications and chromatin organization in somatic cell transcription. Progress on these fronts has primarily been made in the areas of dosage compensation (see X-chromosome dosage compensation) and germline chromatin organization (see Germline chromatin). For these cases, the evolutionary conservation suggests that somatic cell transcription will similarly be influenced by typical eucaryotic mechanisms of chromatin organization. In many ways then, our understanding of transcription in C. elegans is still in its infancy, reflecting the fact that C. elegans, as a model biological system, is still a growing field that has primarily been exploited for its genetics.
The purpose of this chapter is to provide an overview of transcriptional regulation in C. elegans. It is geared towards an audience that is naïve in the ways of C. elegans gene regulation, but, it also includes information that should be helpful even to the seasoned veteran. The tools for studying transcription in C. elegans will be described in an effort to illustrate successful approaches and highlight techniques that, while useful in other systems, are challenging in the nematode. A review of the general trends in regulatory elements is followed by specific examples of spatial and temporal regulatory strategies. The hope is that this information will serve as both a useful review and an entry point into literature appropriate for specific applications.
Reporter genes are the most commonly used method to study transcriptional regulation in C. elegans. It is straightforward to generate transgenic lines (see Transformation and microinjection), and, as C. elegans is transparent throughout its life, it is easy to visualize reporter gene expression in all cells. Early studies of gene expression relied on lacZ reporter genes and were aided by the development of a set of vectors by the Fire lab (Fire et al., 1990). These lacZ reporters were very useful for determining cis-acting transcriptional control elements (Fire and Waterston, 1989; MacMorris et al., 1994; Okkema et al., 1993) and lacZ continues to be a robust marker that can be pushed to detect even low levels of expression (Wilkinson and Greenwald, 1995).
More recently, Green Fluorescent Protein (GFP), or one of its variants, serves as the common reporter. A GFP coding cassette can be inserted in different locations within a large genomic clone (tens of kilobases) to generate transcriptional and/or translational fusions. These constructs provide the greatest chance of capturing required cis-acting regulatory elements. However, it is more common to make the assumption that genomic sequences 5′ to the coding region represent the core promoter. This region, usually a few kilobases in length, can be PCR amplified and easily cloned into the reporter gene backbone of choice. Alternatively, the Promoterome Project can serve as a source for many promoter regions and is useful if the reporter gene cloning strategy is Gateway (Invitrogen) compatible (Dupuy et al., 2004). These constructs are commercially available through Open Biosytems.
There are several considerations to take into account when making reporter genes. One is the distinction between transcriptional and translational reporters; often one would like to have both. For transcriptional reporters, expression can be engineered to highlight the cytoplasm, nucleus or other cellular compartments in the expressing cell. Nuclear localized reporters are useful for embryonic cell identification whereas cytoplasmic reporters are often more useful for larval cells, particularly in neurons where they highlight the axonal and dendritic tracks. The Chalfie lab is developing a two-part fluorescent expression system that has the potential to simplify cell type identification (Zhang et al., 2004). Translational reporter genes can provide information on the subcellular localization of the endogenous gene product. In this case, it is advisable to use the fusion protein to rescue a mutant phenotype, thus demonstrating that some or all aspects of the expression pattern and subcellular localization are biologically relevant.
Once a pattern of expression is determined, promoter analyses can be used to home in on important regulatory elements. Sequential deletions of putative promoter regions linked to a GFP reporter gene are easily made by traditional cloning or Splicing by Overlap Extension (SOEing) Polymerase Chain Reaction (PCR) amplification (Horton et al., 1990). The latter technique allows high throughput and convenience as PCR reactions can be injected directly into animals without purification or cloning (Hobert, 2002). As the control elements become localized to small genomic regions (several hundred base pairs or less), they can be placed upstream of “basal” promoters to assay for enhancer activity. These approaches are common in the C. elegans literature and can be very successful in defining important cis-acting regulatory sequences.
The use of reporter genes has several important caveats. First and foremost is that they are artificial and can easily misrepresent the pattern of gene expression. Important positive- and negative-acting control elements can be excluded by assuming promoter location leading to mosaicism, loss of expression, or ectopic expression (Krause et al., 1994). For example, reporter genes that lack sufficient control elements for fidelity are often expressed in the anterior and posterior intestinal cells or a small set of head neurons. Moreover, small changes in promoter regions can dramatically alter expression patterns as illustrated by the studies of the ges-1 gene (Egan et al., 1995). It is critical to confirm reporter gene expression patterns with an independent technique such as in situ hybridization, antibody staining, or mutant phenotype.
A second concern of reporter genes is the nature of any additional modules in the construct or in a co-injected marker that may have an effect on expression. For example, many of the standard constructs available from the Fire Lab have a 3′ untranslated region (UTR) derived from the unc-54 gene encoding muscle myosin heavy chain. These sequences may not be neutral when combined with promoters from other cell types. There are also reports of dramatic effects of co-transformation markers on expression levels suggesting that you should not rely on a single co-transformation marker when exploring the expression of a novel gene (Fukushige and Siddiqui, 1995). Similarly, “basal” promoters (e.g., pes-10) may be biased in working with certain types of genomic elements. This may cause them to fail to respond in certain cell types resulting in false information about a particular enhancer element (Natarajan et al., 2004). As long as you keep these limitations in mind, reporter genes can be very helpful in characterizing transcriptional control elements for the gene of interest.
Genome-wide approaches provide a more global assessment of transcriptional regulation and have begun to become more common in C. elegans. The availability of both spotted cDNA (see Kim Lab) and oligonucleotide microarrays (e.g., Affymetrix) for C. elegans has given birth to a large amount of gene expression data in response to tissue type (e.g., germline-enriched; Reinke et al., 2004), growth conditions (e.g., dauer; Liu et al., 2004), or mutant background (e.g., DAF-16; McElwee et al., 2003). Much of this data is available on web sites (e.g., http://genome-www5.stanford.edu/cgi-bin/login.pl) or is linked in Wormbase to individual genes. A second global approach for gene expression profiling is Serial Analysis of Gene Expression (SAGE) that has recently been combined with tissue isolation or cell type sorting (McKay et al., 2003; http://elegans.bcgsc.ca/home/sage.html). These approaches give an overview of expression. For specific genes, this data should be validated by an independent method, such as reverse transcriptase (RT)-PCR or reporter genes.
Bioinformatics provides another way to study gene regulation, either alone or in combination with other methods. Currently, the genome sequence of two Caenorhabditis species are finished (elegans and briggsae), one is in draft (remanei), and two are planned (japonica and CB5161). Interspecific comparisons of non-coding regions provides a powerful tool in identifying important cis-acting regulatory elements controlling gene expression, as functional elements will remain constrained through evolution. Comparisons between C. elegans and C. briggsae revealed important cis-acting sequences controlling the vitellogenin genes and helped to identify GATA-type transcription factors as likely regulators (MacMorris et al., 1994; Spieth et al., 1991; Winter et al., 1996; Zucker-Aprison and Blumenthal, 1989). Such comparison continue to provide valuable information about cis-acting sequences within gene promoter regions with many examples in the literature (Culetto et al., 1999; Kirouac and Sternberg, 2003; Marshall and McGhee, 2001; Natarajan et al., 2004; Teng et al., 2004). The power of these comparisons is increased as the number of species is increased and will thus become more informative as sequences of additional species are finished. Recently developed programs, such as FamilyJewels, provide methods for sophisticated multiple alignments (Brown et al., 2002). This approach will become widely exploited in coming years to pinpoint regulatory promoter elements.
Bioinformatic analysis of known transcription factor binding sites upstream of coding regions has also been successful. Given a known binding consensus site of sufficient length, the CisOrtho program can be used to ferret out a list of potential genes sharing expression patterns (Bigelow et al., 2004). Bioinformatic comparisons of promoters from genes with the same or overlapping expression patterns can also be informative to home in on potential regulatory elements (for example, see Chang et al., 2004; Guhathakurta et al., 2004). A nice combination of bioinformatics and in vitro studies used the DNA binding properties of DAF-12, a regulator of dauer development and lifespan, to define potential binding sites and gene targets (Shostak et al., 2004). Regardless of the method used, candidate elements and gene targets should be validated experimentally by an independent means.
There are also several techniques for studying gene expression that, while commonplace in other organisms, are not routinely used in the worm. For example, one would ideally isolate pure populations of cells and tease apart transcriptional regulation at a biochemical level. At only 1mm in length as an adult, C. elegans makes tissue dissections tedious or impossible for generating enough homogeneous tissue for biochemical analysis. The recent development of cell culture techniques (see Methods in Cell biology), coupled with cell sorting, may make biochemical approaches more feasible in the future. However, the technique is still challenging enough that most researchers have opted for other methods to study transcription.
In situ hybridization is another common technique for cataloging transcriptional profiles in many organisms but it is less often used in C. elegans studies. The impermeable egg shell of C. elegans embryos and the cuticle of larvae and adults often lead to background hybridization or partially permeabilized animals, making it difficult to get in situ hybridization signals that are reproducible or trustworthy. Despite these difficulties, a genome-scale effort to catalog gene transcription profiles using in situ hybridization by the Kohara group is now underway. Their protocols and data are useful and can be accessed at http://nematode.lab.nig.ac.jp/db2/index.php.
The majority of protein coding genes in C. elegans are within gene-dense regions of the genome. Consequently, cis-acting regulatory regions are usually close to the coding region. The minimal promoter region required for proper expression of most Pol II transcripts lies within a couple of kilobases upstream of the start codon. There are notable exceptions to this compact view of cis-acting sequences. For example, egl-1 expression is controlled, in part, by an element located greater than 2 kb downstream of the coding region and beyond an unrelated, intervening gene (Thellmann et al., 2003). For lin-39, proper reporter gene expression required inclusion of ~30 kb of genomic DNA that extended upstream and downstream of the protein coding region (Wigmaister and Eisenmann, personal communication). Clearly C. elegans genes can have complex and distant control regions. However, a rule-of-thumb of 2 kb upstream of the ATG works well as a starting point in the search for cis-acting control elements.
It is important to remember that the minimal promoter region is not synonymous with the natural promoter. The natural promoter may span a much larger region due to redundancy in the function of regulatory elements that ensure proper and robust regulation of the endogenous gene. One common site of additional control elements is within introns. Most C. elegans introns are small (e.g., <100 bp; see Alternative splicing in C. elegans) and are thus unlikely to contain elements controlling expression. However, introns larger than several hundred base pairs do often have such elements (e.g., Nam et al., 2002; Okkema et al., 1993). Therefore, intron size can provide a clue in searching for transcriptional control sequences. Large introns, particularly at the beginning of a coding region, may also provide a clue to promoter organization and the presence of multiple transcriptional initiation sites. For example, nhr-23 has a 1.8 kb intron at the start of the gene that is included in one transcript and absent in a second (Kostrouchova et al., 1998). In cases such as this, the presence of a trans-spliced leader (see Trans-splicing and operons) on two or more different transcripts from a single gene can be an indicator of multiple messages, possibly encoding different protein isoforms.
A simple promoter is defined here as one in which the cis-acting control elements necessary for proper expression are confined to a small region (a few hundreds of bp) of the genome. Housekeeping genes expressed in all tissues might be good candidates for regulation by simple promoters, Unfortunately, few housekeeping genes in C. elegans have been characterized. Among the best characterized simple promoters are those of the hsp-16 family of genes. This family consists of pairs of divergently transcribed genes with promoter regions sufficient for heat-regulated expression contained within the short (~350 bp) intragenic regions (Jones et al., 1986; Russnak and Candido, 1985; Stringham et al., 1992). Despite these compact promoters, distinct tissue expression patterns are induced from different hsp-16 promoters (Stringham et al., 1992), suggesting the presence of multiple regulatory sites within these simple promoters. Another excellent example of simple promoters are in the vitellogenin (vit) genes, which exhibit stage-, tissue- and sex-specific expression controlled, in the case of vit-2, by a 247 bp promoter (MacMorris et al., 1992; MacMorris et al., 1994). vit-2 promoter activity depends on GATA-factor binding sites and a novel VPE2 site (TGTCAAT) conserved in vit gene promoters in C. elegans and C. briggsae (Spieth et al., 1985; Zucker-Aprison and Blumenthal, 1989). Certain cell cycle promoters have also been shown to be remarkably simple. Analysis of several genes expressed only in proliferative cells and encoding G1 phase regulators (e.g., cyclin D) revealed that proper regulation minimally required a 67 bp region of the promoter (Brodigan et al., 2003; Park and Krause, 1999). How could genes with such dynamic expression profiles throughout development be regulated in an apparently simple way? The answer is likely that they are end effectors of a cell's decision to divide rather than integrating lineage or temporal information governing proliferation.
The term complex is used here to describe a promoter in which the overall pattern of gene expression is the result of the composite action of several dispersed elements, each influencing or contributing to the overall expression pattern. This piecemeal organization has been described for the promoter region of several genes, including myo-2, hlh-1 and lin-26. These studies reveal examples in which spatial control of transcription is regulated by elements active in groups of cells related by cell-, tissue- and organ-type and by lineage history.
myo-2 encodes a myosin heavy chain expressed exclusively in the pharyngeal muscles as these cells undergo terminal differentiation (Ardizzi and Epstein, 1987; Miller et al., 1983). Characterization of the myo-2 promoter region in transgenic C. elegans and identification of trans-acting regulators indicates expression is regulated by a combination of organ- and cell type-specific signals targeting distinct regulatory sequences.
High level activity of the myo-2 promoter requires a transcriptional enhancer located approximately 300 bp upstream of the transcriptional start (Okkema et al., 1993). The intact myo-2 enhancer is active exclusively in the pharyngeal muscles, but, surprisingly, its activity depends on distinct cell-type-specific and organ-specific subelements, termed B and C, that can separately activate gene expression either specifically in the pharyngeal muscles, or more globally in all pharyngeal cell types (Okkema and Fire, 1994). In their endogenous context within the myo-2 gene, these subelements synergistically activate pharyngeal muscle gene expression.
Consistent with their distinct activities, the B and C subelements are targeted by transcription factors expressed in different spatial patterns in the pharynx (Figure 1). The cell-type-specific B subelement binds and is activated by the pharyngeal muscle specific NK-2 family homeodomain factor CEH-22 (Okkema and Fire, 1994; Okkema et al., 1997), which is structurally and functionally related to factors controlling cardiac muscle development in other species (Haun et al., 1998). The organ-specific C subelement binds and is activated by the pan-pharyngeal FoxA family transcription factor PHA-4 (Kalb et al., 1998), which is required for formation of pharyngeal muscle and all other pharyngeal cell types during embryonic development (see below).
CEH-22 is not the only factor functioning with PHA-4 to activate myo-2 expression. CEH-22 is expressed in most, but not all, myo-2 expressing pharyngeal muscles (Okkema and Fire, 1994). Likewise a ceh-22 mutant expresses myo-2, although these animals exhibit defects in B subelement activity and pharyngeal muscle development and function (Okkema et al., 1997). Thus, other as yet unidentified factors must contribute to myo-2 expression, and the characterization of these factors will enhance our understanding of pharyngeal muscle development.
Figure 1. CEH-22 and PHA-4 function in combination to activate pharyngeal muscle expression of myo-2. myo-2 expression is activated by the pharyngeal muscle-specific CEH-22 and the pan-pharyngeal PHA-4, which bind the myo-2 enhancer B and C subelements, respectively. Micrographs indicate trangenic embryos expressing ceh-22::gfp in pharyngeal muscles and pha-4::gfp in all pharyngeal cells (top, delimited by arrowheads), and a transgenic adult expressing myo-2::lacZ in the pharyngeal muscles (bottom). Note pha-4::gfp is also expressed in the gut. GFP and β-galactosidase are targeted to nuclei to facilitate cell identification.
hlh-1 encodes a basic helix-loop-helix transcription factor expressed in all body wall muscle cells and their precursors (Krause et al., 1990). The body wall muscle cells are derived from multiple cell lineages. Of the 81 body wall muscle cells born during embryogenesis, 1 is from the AB lineage, 28 are from the MS lineage, 32 are from the C lineage and 20 are from the D lineage (Sulston et al., 1983). An additional 14 body wall muscle cells (and other cell types) are born postembryonically from the M mesoblast (Sulston and Horvitz, 1977).
Dissection of the hlh-1 promoter shows that gene expression can be properly regulated by multiple elements spanning ~3 kb upstream of the ATG (Figure 2; Krause et al., 1994). A core element required for all expression resides just upstream of the ATG. In addition, there are several individual elements that drive expression preferentially in one or more lineages. However, no single element is specific for expression in just one lineage. In addition, the expression during embryogenesis is controlled by a different region than that controlling postembryonic expression. The overall pattern of hlh-1 expression is thus a composite of the action of several lineage-preference elements with overlapping domains of action, working in concert with an essential core element. Superimposed on this spatial pattern of regulation are distinct temporal control elements regulating timing of expression during development. As yet, no trans-acting factors have been identified that bind to the defined cis-acting elements, illustrating the difficulty in using promoter analysis alone to identify trans-acting factors.
Figure 2. Regulation of hlh-1 expression by lineage-preference elements. A) A schematic of body wall muscle nuclei is super-imposed on an image of a comma stage embryo. Each of four different lineages of origin is color-coded as shown in (B) (adapted from (Sulston et al., 1983). C) The promoter and partial coding region (exons 1 and 2) of hlh-1 are shown (adapted from (Krause et al., 1990). All expression is dependent on a “core” element (star) located upstream of the ATG of exon 1. Below the gene structure diagram are color-coded elements that can direct lineage-preference expression of transgenes during embryogenesis; color coding as in (A) and (B). Mature body wall muscle is dependent on distinct temporal elements (purple boxes) that do not have lineage preferences.
lin-26 encodes a predicted zinc-finger transcription factor expressed in a broad range of ectodermally derived epithelial tissues, the somatic gonad and uterus (Labouesse et al., 1996; Labouesse et al., 1994). Within these ectodermally-derived epithelial tissues are the major hypodermis surrounding the body of the animal, specialized hypodermal cells located at the anterior and posterior ends of the body, and interfacial cells such as rectal cells connecting the external epithelium to the endoderm. A recent characterization of the lin-26 promoter region revealed this gene is regulated by a core element required for all expression working in concert with tissue-specific elements, rather than lineage-preference elements as discussed above for hlh-1 (Landmann et al., 2004).
lin-26 is the downstream gene in an alternatively spliced operon including lir-1 (Dufourcq et al., 1999), and proper expression of lin-26 requires an 11 kb upstream region including most of the lir-1 gene itself (den Boer et al., 1998). Within this region are tissue specific regulatory modules that activate gene expression in subsets of lin-26 expressing tissues (Figure 3; Landmann et al., 2004). For example, separable modules control expression in the major hypodermal cells, in the minor hypodermal cells and sheath and socket support cells, in rectal cells, or in the somatic gonad. In some cases, redundant elements contribute to expression in particular tissues (e.g., major hypodermal cells), and, in the case of the minor hypodermis and support cells located at the worms anterior and posterior ends, separable elements active either in anterior or posterior ends were identified. Thus, the lin-26 promoter region contains cis-regulatory elements active in cells that belong to the same organ, are functionally related, or have similar positions along the body (Landmann et al., 2004), and these elements together produce the full lin-26 expression pattern in a piecemeal fashion.
Figure 3. Regulation of lin-26 by tissue-specific elements. A) A diagram of some of the cell types expressing lin-26 (adapted from (Landmann et al., 2004) is shown super-imposed on an image of a comma stage embryo. B) A partial embryonic lineage showing the origin of lin-26 expressing cells with color coding matching the cell types shown in (A)(adapted from (Landmann et al., 2004): major hypodermal cells include hyp 7 (green), seam cells (orange), and P cells (purple); support cells (red); somatic gonad precursors Z1 and Z4 (yellow). C) The promoter elements for lin-26. All expression is dependent on a “core” element (star) located in the intergenic region between lin-26 and its upstream neighbor lir-1. Tissue-specific control elements, located within a lir-1 intron, are shown below the gene structure diagram with color-coding as in (A) and (B). Most control elements function in cells related by tissue-type but not by lineage. Note also that temporal control is achieved by sequentially acting elements that are progressively further upstream from the ATG of lin-26.
One common theme to emerge from these three examples is redundancy of regulatory elements. In most cases, even when sub-elements are identified with specific tissue, lineage or organ activity, their loss does not prevent all expression in that region. Clearly endogenous gene regulation has evolved to include multiple and overlapping regulatory regions to ensure proper expression during development. The deconstruction of a promoter is most useful in showing a minimal set of cis-acting control elements. As studies employ more sophisticated techniques and assays, we may learn how extensive this redundancy is.
The completion of the C. elegans genome makes it possible, in theory, to define all transcription factors in the worm. In practice, this effort is more difficult because of several uncertainties when surveying the properties of a given gene. For example, zinc finger motifs can bind DNA but also can serve other functions including RNA binding and protein-protein interactions. It is therefore difficult to conclude that a given gene product is indeed a transcription factor based solely on the presence of signature motifs. For factors that modify chromatin or participate in a transcription complex, the definition of a transcription factor often lies in the eyes of the investigator. A first-pass attempt at defining a list of C. elegans transcription factors is presented in Table 1. Originally compiled in the Sternberg Lab (courtesy of T. Ririe and J. Fernandes), we present a modified version of their list with the understanding that it will necessarily need refinement over time to correct inaccuracies and omissions. The current list includes 664 genes representing only about 3.5% of the predicted genes in C. elegans. This number is surprisingly low and about one half the number of transcription factors estimated previously (McGhee and Krause, 1997).
Table 1. C. elegans transcription factors
| Gene | Affy probe set | Description |
|---|---|---|
| gfl-1 | 190531_at | AF-9-like |
| taf-11.3 | 188157_at | an ortholog of human TATA-binding protein associated factor TAF11 |
| R07C12.4 | 185116_s_at | AP-1-like |
| F28C6.1 | 191919_at | AP-2-like |
| F28C6.2 | 191940_at | AP-2-like |
| K06A1.1 | 191145_at | AP-2-like |
| Y62E10A.17 | 186925_at | AP-2-like |
| Y73E7A.2 | 176887_at | Apoptosis antagonizing transcription factor |
| aha-1 | 172967_x_at, 172057_x_at | bHLH |
| ahr-1 | 193149_at | bHLH |
| C15C8.2 | 192573_at | bHLH |
| cnd-1 | 187594_at | bHLH |
| F38C2.8 | 172921_x_at | bHLH |
| hif-1 | 183824_s_at | bHLH |
| hlh-1 | 193759_at | bHLH |
| hlh-10 | 189550_at | bHLH |
| hlh-11 | 171841_x_at | bHLH |
| hlh-12 | 186469_at | bHLH |
| hlh-13 | 182960_at | bHLH |
| hlh-14 | 192984_at | bHLH |
| hlh-15 | 182141_at | bHLH |
| hlh-16 | 192331_at | bHLH |
| hlh-17 | 172921_x_at | bHLH |
| hlh-19 | 193041_at | bHLH |
| hlh-2 | 193176_s_at | bHLH |
| hlh-21 | 183551_at | bHLH |
| hlh-25 | 184961_s_at | bHLH |
| hlh-26 | 183674_at | bHLH |
| hlh-27 | 184961_s_at | bHLH |
| hlh-28 | 179991_s_at | bHLH |
| hlh-29 | 179991_s_at | bHLH |
| hlh-3 | 192523_at | bHLH |
| hlh-4 | 190064_at | bHLH |
| hlh-6 | 193106_at | bHLH |
| hlh-8 | 193985_at | bHLH |
| hnd-1 | 192707_at | bHLH |
| lin-32 | 188671_at | bHLH |
| mdl-1 | 193723_s_at | bHLH |
| ngn-1 | 176272_at | bHLH |
| T01D3.2 | 190290_at | bHLH |
| W02C12.3 | 190299_at | bHLH |
| Y105C5B.29 | 172921_x_at | bHLH |
| Y39A3CR.6 | 186966_at | bHLH |
| mxl-1 | 193662_at | bHLH/ZIP |
| mxl-2 | 184737_s_at | bHLH/ZIP |
| mxl-3 | 192645_at | bHLH/ZIP |
| bra-2 | 172599_x_at | BMP receptor associated protein family |
| taf-3 | 177494_at | bromodomain |
| atf-2 | 189943_at | bZIP |
| atf-5 | 192027_s_at | bZIP |
| atf-6 | 188462_at | bZIP |
| atf-7 | 173892_s_at, 172407_x_at | bZIP |
| C27D6.4 | 188956_at, 187831_at, 176715_at, 193833_s_at | bZIP |
| C34D1.5 | 193844_at | bZIP |
| C48E7.11 | bZIP | |
| ces-2 | 193437_s_at | bZIP |
| crh-1 | 189423_s_at | bZIP |
| F17A9.3 | 191134_at | bZIP |
| F23F12.9 | 187591_at | bZIP |
| F23F12.9 | 171733_x_at | bZIP |
| F29G9.4 | 191088_at | bZIP |
| F57B10.1 | 185841_s_at | bZIP |
| K02F3.4 | 181230_at | bZIP |
| mgl-2 | 173929_s_at, 193356_s_at | bZIP |
| pha-1 | 188075_at | bZIP |
| R07H5.10 | 181517_at | bZIP |
| skn-1 | 188421_at | bZIP |
| srx-41 | 183336_at | bZIP |
| T24H10.7 | 180818_at, 188946_at | bZIP |
| T27F2.4 | 179204_s_at | bZIP |
| W02H5.7 | 182521_at | bZIP |
| W07G1.3 | 179163_at, 179658_at | bZIP |
| W08E12.1 | 184700_at | bZIP |
| xbp-1 | 190863_at | bZIP |
| Y75B8A.29 | 185348_at | bZIP |
| ZC376.7 | 189598_s_at | bZIP |
| ZC8.4a.1 | 191675_s_at | bZIP |
| zip-1 | 173457_s_at | bZIP |
| T05C1.4 | 179917_at, 182254_s_at | calmodulin-binding transcription activator (CAMTA) |
| cbp-1 | 173017_at, 191123_s_at | CBP/p300 homolog |
| lpd-2 | 182912_at | CCAAT-binding |
| T08D10.1 | 189859_s_at | CCAAT-binding |
| F23F1.1 | 174270_s_at | CCAAT-binding, subunit C (HAP5) |
| Y51H1A.5 | 182089_at | CCAAT-binding |
| F22F1.3 | 194040_at | KIX domain, coactivator CBP |
| lag-1 | 175617_at, 192000_s_at | CSL |
| cdk-8 | 190709_at | cyclin C interactor |
| pqn-45 | 177704_at | DEC-1-like |
| pqn-47 | 182066_s_at | DEC-1-like |
| mab-23 | DM DNA-binding | |
| mab-3 | 192765_at | DM DNA-binding |
| C27C12.6 | 173904_at | DM DNA-binding |
| Y53F4B.3 | 181571_s_at | DNA Polymerase epsilon, subunit C |
| F10C1.5 | 192453_at | Doublesex |
| dpl-1 | 191593_s_at | E2F/DP1 |
| F49E12.6 | 175552_at, 189678_at | E2F/DP1 |
| elf-1(mex-2) | 186476_s_at | E2F/DP1 |
| elf-2 | 186271_at | E2F/DP1 |
| C24A1.2 | 174325_at | ETS domain |
| C33A11.4 | 188664_at | ETS domain |
| C42D8.4 | 189894_at | ETS domain |
| C50A2.4 | 183564_at | ETS domain |
| C52B9.2 | 180922_at | ETS domain |
| F19F10.1 | 190163_at | ETS domain |
| F19F10.5 | 190230_at | ETS domain |
| F22A3.1 | 192249_at | ETS domain |
| lin-1 | 173446_s_at, 175607_s_at | ETS domain |
| T08H4.3 | 188791_s_at | ETS domain |
| Y73F8A.14 | ETS-related | |
| peb-1 | 188015_at | Zn-finger, FLYWCH |
| C34B4.2 | 192149_at | Forkhead |
| daf-16 | 188176_s_at, 181992_s_at | Forkhead |
| pes-1 | 188273_at | Forkhead |
| pha-4 | 193962_at | Forkhead |
| T27A8.2 | 189248_at | Forkhead |
| unc-130 | 190731_s_at | Forkhead |
| fkh-10 | 190179_at | Forkhead |
| fkh-2 | 193957_s_at, 174011_at | Forkhead |
| fkh-3 | 190724_s_at | Forkhead |
| fkh-4 | 172877_x_at | Forkhead |
| fkh-5 | 187411_at | Forkhead |
| fkh-6 | 191946_at | Forkhead |
| fkh-7 | 187837_at | Forkhead |
| fkh-8 | 181677_at | Forkhead |
| fkh-9 | 188781_at | Forkhead |
| let-381 | 175474_at | Forkhead |
| lin-31 | 188600_at | Forkhead |
| C18G1.2 | 191147_at, 183542_at, 174326_s_at | GATA |
| egl-18 | 190079_at | GATA |
| elt-1 | 192655_s_at | GATA |
| elt-2 | 193259_at | GATA |
| elt-3 | 193640_s_at | GATA |
| elt-4 | GATA | |
| elt-6 | 185110_at | GATA |
| elt-7 | GATA | |
| end-1 | 193618_at | GATA |
| end-3 | 193616_at | GATA |
| med-1 | 188376_at | GATA |
| med-2 | GATA | |
| F55A3.3 | 171809_s_at, 190137_s_at | global transcriptional regulator |
| F31F7.3 | 182888_at | Golden2-like |
| unc-37 | 173188_s_at, 193614_s_at | Groucho |
| egr-1 | 188149_s_at, 188166_s_at | HDAC, GATA-like) |
| K08B5.2 | 182979_at | heat shock |
| dro-1 | 192105_at | histone-like |
| W10D9.4 | 180226_s_at | histone-like |
| lin-22 | 175682_at | HLH |
| ref-1 | 193898_at | HLH |
| sbp-1 | 192735_s_at | HLH |
| hmg-1.1 | 175799_at | HMG box |
| hmg-1.2 | 175806_at | HMG box |
| hmg-11 | 193301_at | HMG box |
| hmg-12 | 188169_at | HMG box |
| hmg-3 | 193484_s_at | HMG box |
| hmg-4 | 188651_s_at | HMG box |
| hmg-5 | 187989_at | HMG box |
| pop-1 | 188002_at | HMG box |
| gei-3 | 178044_at, 192981_at | HMG-box |
| C02F12.5 | 192268_at | homeobox domain |
| C07E3.6 | 189406_at, 178029_at | homeobox domain |
| C09G12.1 | 189632_at | homeobox domain |
| C12D12.4 | 184383_at | homeobox domain |
| C12D12.5 | 184573_at | homeobox domain |
| C17H12.9 | 189669_at | homeobox domain |
| C18B12.3 | 189424_at | homeobox domain |
| C36F7.1 | 189594_at | homeobox domain |
| C49C3.5 | 189172_s_at | homeobox domain |
| ceh-1 | 176628_at | homeobox domain |
| ceh-10 | 176624_at | homeobox domain |
| ceh-12 | 192019_at | homeobox domain |
| ceh-13 | 176615_at | homeobox domain |
| ceh-14 | 192631_s_at | homeobox domain |
| ceh-16 | 188939_at | homeobox domain |
| ceh-17 | 191721_at | homeobox domain |
| ceh-19 | 192315_at | homeobox domain |
| ceh-2 | 191750_at | homeobox domain |
| ceh-22 | 175701_at, 193693_s_at | homeobox domain |
| ceh-23 | 189145_at | homeobox domain |
| ceh-24 | 193511_at | homeobox domain |
| ceh-26 | 176620_at | homeobox domain |
| ceh-27 | 192317_at | homeobox domain |
| ceh-28 | 193479_at | homeobox domain |
| ceh-30 | 189850_at | homeobox domain |
| ceh-31 | 189951_at | homeobox domain |
| ceh-32 | 193576_at | homeobox domain |
| ceh-33 | 188645_at | homeobox domain |
| ceh-34 | 192166_at | homeobox domain |
| ceh-35 | 192928_at | homeobox domain |
| ceh-36 | 173826_s_at | homeobox domain |
| ceh-37 | 192014_s_at | homeobox domain |
| ceh-40 | 192022_at | homeobox domain |
| ceh-41 | 193249_at | homeobox domain |
| ceh-43 | 193833_at | homeobox domain |
| ceh-5 | 191117_at | homeobox domain |
| ceh-6 | 194028_s_at | homeobox domain |
| ceh-7 | 191844_at | homeobox domain |
| ceh-8 | 193515_at | homeobox domain |
| ceh-9 | homeobox domain | |
| egl-5 | 175712_s_at | homeobox domain |
| F17A9.6 | 189546_s_at | homeobox domain |
| F22A3.5 | 190371_at | homeobox domain |
| F42G2.6 | 181119_s_at | homeobox domain |
| F45C12.15 | 186689_s_at | homeobox domain |
| K02F3.8 | 181123_at | homeobox domain |
| lin-39 | 189637_at | homeobox domain |
| M6.3 | 189007_at | homeobox domain |
| mab-18 | 194012_s_at | homeobox domain |
| mab-5 | 173387_at, 176579_s_at | homeobox domain |
| mls-2 | 173754_at, 189498_s_at, 182023_at | homeobox domain |
| nob-1 | 191468_s_at | homeobox domain |
| pal-1 | 193341_at, 193342_s_at | homeobox domain |
| pax-1 | 192418_at | homeobox domain |
| pax-3 | 193866_at | homeobox domain |