Broad Institute
Freely available data, software and tools for genomic analysis (archived link page; some links may no longer work).
California Biobank Program
The California Biobank Program (CBP) represents the combined biospecimen and data resources of two California Department of Public Health (CDPH) screening and monitoring programs, the California Birth Defects Monitoring Program (CBDMP) and the California Genetic Disease Screening Program (GDSP). GDSP administers both the Newborn Screening Program (NBS) and the Prenatal Screening Program (PNS).
DDBJ (DNA Data Bank of Japan)
A public database of annotated nucleotide sequences. Includes the Japanese Genotype-phenotype Archive (JGA), personal genotype and phenotype data from individuals who have signed consent agreements authorizing data release only for specific research uses.
EMBL-EBI (European Bioinformatics Institute)
A collection of freely available tools and data resources including AlphaFold (protein structures), BioModels (computational models), ChEMBL (bioactive compounds), Ensembl (genome browser), Expression Atlas (gene expression), GWAS Catalog (genome-wide association studies), Protein Data Bank (3D structures), and UniProt (protein sequences).
ExPASy
Portal of the SIB Swiss Institute of Bioinformatics to databases and software tools in proteomics, genomics, phylogeny, systems biology, evolution, population genetics, and transcriptomics. Formerly the Expert Protein Analysis System.
GDC (Genomic Data Commons of the National Cancer Institute)
NCI-generated data from cancer genomic datasets, including The Cancer Genome Atlas (TCGA) and Therapeutically Applicable Research to Generate Effective Therapies (TARGET), supporting the elucidation of the molecular basis of cancer.
miRBase
A database of published miRNA sequences and annotations for hundreds of different species.
NCBI (National Center for Biotechnology Information)
Includes Clinically Relevant Variants (ClinVar), Database of Genotypes and Phenotypes (dbGaP), Database of Human Genetic Variation (dbVar), Gene Expression Omnibus (GEO), Genetic Testing Registry (GTR), MedGen, Online Mendelian Inheritance in Man (OMIM), and Single Nucleotide Polymorphisms (SNP).
UCSC Xena
Allows users to explore functional genomic data sets for correlations between genomic and/or phenotypic variables. The next generation of the UCSC Cancer Genomics Browser.
VectorBase
Provides genomic, phenotypic and population-centric data for invertebrate vectors of human pathogens. Sponsored by NIAID-BRC (National Institute of Allergy and Infectious Diseases Bioinformatics Resource Center).
Online Bioinformatics Resources
BDGP (Berkeley Drosophila Genome Project)
A consortium working to sequence the euchromatic genome of Drosophila melanogaster and to generate and maintain biological annotations of this sequence.
Broad Institute
Freely available data, software and tools for genomic analysis (archived link page; some links may no longer work).
California Biobank Program
The California Biobank Program (CBP) represents the combined biospecimen and data resources of two California Department of Public Health (CDPH) screening and monitoring programs, the California Birth Defects Monitoring Program (CBDMP) and the California Genetic Disease Screening Program (GDSP). GDSP administers both the Newborn Screening Program (NBS) and the Prenatal Screening Program (PNS).
DDBJ (DNA Data Bank of Japan)
A public database of annotated nucleotide sequences. Includes the Japanese Genotype-phenotype Archive (JGA), personal genotype and phenotype data from individuals who have signed consent agreements authorizing data release only for specific research uses.
EcoCyc
A database for the bacterium Escherichia coli K-12 MG1655. The EcoCyc project performs literature-based curation of the entire genome, and of transcriptional regulation, transporters, and metabolic pathways.
EMBL-EBI (European Bioinformatics Institute)
A collection of freely available tools and data resources including AlphaFold (protein structures), BioModels (computational models), ChEMBL (bioactive compounds), Ensembl (genome browser), Expression Atlas (gene expression), GWAS Catalog (genome-wide association studies), Protein Data Bank (3D structures), and UniProt (protein sequences).
ExPASy
Portal of the SIB Swiss Institute of Bioinformatics to databases and software tools in proteomics, genomics, phylogeny, systems biology, evolution, population genetics, and transcriptomics. Formerly the Expert Protein Analysis System.
GDC (Genomic Data Commons of the National Cancer Institute)
NCI-generated data from cancer genomic datasets, including The Cancer Genome Atlas (TCGA) and Therapeutically Applicable Research to Generate Effective Therapies (TARGET), supporting the elucidation of the molecular basis of cancer.
Hymenoptera Genome Database (HGD)
Genome sequence data for insects of the order Hymenoptera (e.g. bees, wasps, ants). HGD integrates with the data mining tool HymenopteraMine.
JGI (Joint Genome Institute)
Tools for the analysis and functional characterization of publicly available genomes of plants, fungi, and microbes, as well as environmental metagenomes.
MetaCyc
A curated database of experimentally elucidated metabolic pathways from thousands of different organisms from all domains of life.
MGI (Mouse Genome Informatics)
Genetic, genomic, and biological data to facilitate the study of the laboratory mouse as a model organism for human health and disease.
miRBase
A database of published miRNA sequences and annotations for hundreds of different species.
PomBase
Structural and functional annotation, literature curation and large-scale data sets for the fission yeast Schizosaccharomyces pombe.
Public Library of Science PLOS
PLOS is a nonprofit, Open Access publisher empowering researchers to accelerate progress in science and medicine by leading a transformation in research communication.
PubMed
PubMed® comprises more than 34 million citations for biomedical literature from MEDLINE, life science journals, and online books. Citations may include links to full text content from PubMed Central and publisher web sites.
SGD (Saccharomyces Genome Database)
Integrated biological resources for the budding yeast Saccharomyces cerevisiae along with search and analysis tools.
UCSC Xena
Allows users to explore functional genomic data sets for correlations between genomic and/or phenotypic variables. The next generation of the UCSC Cancer Genomics Browser.
VectorBase
Provides genomic, phenotypic and population-centric data for invertebrate vectors of human pathogens. Sponsored by NIAID-BRC (National Institute of Allergy and Infectious Diseases Bioinformatics Resource Center).
WormBase
A resource on the genetics, genomics and biology of C. elegans and related nematodes.
XenBase
Genomic, expression and functional data for the model organisms Xenopus laevis and X. tropicalis.
ZFIN (Zebrafish Information Network)
The zebrafish model organism database, including reference information on zebrafish genetics, genomics and development
Bioinformatic Podcasts
The Bioinformatics Chat
The bioinformatics chat is a podcast about computational biology, bioinformatics, and next generation sequencing.
The bioinformatics chat is produced and hosted by Roman Cheplyaka. Several awesome machine learning-themed episodes have been hosted by Jacob Schreiber.
The Bioinformatics CRO Podcast
On The Bioinformatics CRO Podcast, we chat with scientists and others in biotech to discuss interesting topics across biomedical research and to explore what made them who they are today.
Our guests include biotech CEOs, science communicators, academic researchers, and more.
The Data Pulse Podcast
Dive into the growing role that data science plays in the latest biomedical innovations. I'm your host, Anika Gupta, a PhD student in Bioinformatics at Harvard and the Broad Institute. Join me for ~30 minutes each week as I go behind the scenes and check the pulse with domain experts and rising stars who are leading advances in data-driven human health.
MicroBinfie Podcast
Microbial Bioinformatics is a rapidly changing field marrying computer science and microbiology. Join us as we share some tips and tricks we’ve learnt over the years. If you’re student just getting to grips to the field, or someone who just wants to keep tabs on the latest and greatest - this podcast is for you.
PhenoTips Speakers Series
The PhenoTips Speaker Series is a live webinar and podcast series featuring genomics thought leaders in genetic counseling, bioinformatics, and genomics.
Bioinformatics OER (Open Educational Resources)
LibreTexts - Biology
This Living Library is a principal hub of the LibreTexts project, which is a multi-institutional collaborative venture to develop the next generation of open-access texts to improve postsecondary education at all levels of higher learning. The LibreTexts approach is highly collaborative where an Open Access textbook environment is under constant revision by students, faculty, and outside experts to supplant conventional paper-based books.
Open Textbooks - Biology
The Open Textbook Library was started so that faculty could find open textbooks in one place. More technically, the Open Textbook Library is a comprehensive referatory that points to open textbooks by a variety of authors and publishers.