Parameters:

Select species:

Select IP:

Select database:

Biological source

Search by gene
Search by genomic coordinates

Please type a gene name (HUGO symbol)

Which portion of the gene to query?

Please input the coordinates:

(by 1 kb)

Results:

Download the CSV file
Download the BED file

Statistics for the metadata in GEO/ENCODE database

Select species:

Select IP:

Select database:

TFmapper

The main purpose of TFmapper is to search all experimental ChIP-seq datasets and identify the trans-acting factors or histone modifications which show peaks at a gene of interest or a specified genomic region in a defined biological sample.

The workflow diagram of TFmapper.

Peak files in the BED (Browser Extensible Data) format are downloaded from GEO/Cistrome and ENCODE. Peaks are annotated to genomic features (Promoter, TTS, 5’UTR, 3’UTR, Intron, Exon, Intergenic) using the software HOMER with GRCh38/hg38 for human and GRCm38/mm10 for mouse as the reference genomes. The annotation results are stored in a MySQL database. To increase the speed of query processing, the peaks are split by species, sources, factors, and chromosomes. In this way, the average of the number of rows would be about a few millions. For the client side, all the elements on the HTML page are built by R (Shiny), and result tables are created with the DataTables JavaScript library. Results can be downloaded in the CSV or BED format, and peaks can be directly visualized in the in the WashU Epigenome Browser or the UCSC Genome Browser.

To search:

Users can query the database by 1) gene symbols or 2) genomic coordinates of GRCh38/hg38 for human or GRCm38/mm10 for mouse respectively.

To visualize multiple peaks:

When multiple peaks are selected, a link for visualization in the WashU Epigenome Browser will appear (red arrow 1).

Advanced Searching

The results can be further filtered by typing in the boxes (figure 1, red arrow 2); a slider will appear when the box under ‘Distance’ being clicked, which can be moved to narrow down the region.

About Section:

Next-generation sequencing coupled to chromatin immunoprecipitation (ChIP-seq), DNase I hypersensitivity (DNase-seq) and the transposase-accessible chromatin assay (ATAC-seq) has generated enormous amounts of data, markedly improved our understanding of the transcriptional and epigenetic control of gene expression. To take advantage of the availability of such datasets and provide clues on what factors, including transcription factors, epigenetic regulators and histone modifications, potentially regulates the expression of a gene of interest, a tool for simultaneous queries of multiple datasets using symbols or genomic coordinates as search terms is needed.

TFmapper

TFmapper allows users to search across thousands of ChIP-seq data generated by ENCODE project, or ChIP-seq/DNase-seq/ATAC-seq datasets deposited in Gene Expression Omnibus (GEO) and curated by Cistrome project to find factors regulating the expression of a gene of interest (GOI).

How to cite TFmapper

Zeng J, Li G. (2018) TFmapper: A Tool for Searching Putative Factors Regulating Gene Expression Using ChIP-seq Data. Int J Biol Sci. 2018 Sep 7;14(12):1724-1731

Contact Us

If you have any question about the TFmapper, please contact us at jmzeng1314@163.com or gangli@umac.mo

Acknowledgment:

TFmapper was built using the ENCODE datasets and the GEO datasets curated by CistromeDB