Biochemistry Research Resources and Links

Note: I would like to make this page a compendium of computational research-related resources! Please help me keep this information updated; email me if sites listed here are outdated, redundant, erroneously categorized, or if you want to share additional useful sites: zuritalopez (at) Thank you!

Find journal articles and more via PubMed

Science Librarian Tiffanie Ford-Baxter - Chemistry Library Guide:

Prepare a research report

Here’s an article I found that contains good tips on how to read papers:

Proper Reference Citations also see the articles section in Ch. 14 of the ACS style guide for a very detailed version, and a good summary. 

Reference citation through our library resources: Click on the How To tab on the library's main page and see Cite Sources

Brenda. The comprehensive enzyme information system.


Saccharomyces Genome Database (SGD) provides comprehensive integrated biological information for the budding yeast Saccharomyces cerevisiae along with search and analysis tools to explore these data, enabling the discovery of functional relationships between sequence and gene products in fungi and higher organisms.

Plant Genome Database (PGD)

TAIR Arabidopisis Information Resource

SAGA: Software for the Analysis of Genetic Architecture Explore genes. Find out how genes behave and where they are located. Note: requires software download

DNA and/or protein multiple sequence alignment programs

BLAST (Basic Local Alignment Search Tool) to find regions of similarity between DNA or proteins. The program compares sequences to sequence databases and calculates statistical signficiance. With BLAST, you can align two known sequences or compare your sequences to sequence databases

Pairwise Sequence Alignment to identify regions of similarity that may indicate functional, structural and/or evolutionary relationships between two sequences (protein or nucleic acid).

ClustalW2 a general purpose DNA or protein multiple sequence alignment program for three or more sequences.

Clustal Omega a multiple sequence alignment program that uses seeded guide trees and HMM profile-profile techniques to generate alignments between three or more sequences.

Gene Infinity Many different types of resources including nucleotide to protein conversions.

Benchling R&D cloud lab software with molecular biology tools and lab notebook capabilities.


Cuffdiff - find significant changes in transcript expression, splicing, and promoter use.

Where in the human body is your protein expressed? The Human Proteome Map, analyzed by totalt mass spectromtrey. Human Proteome Map

KEGG: Kyoto Encyclopedia of Genes and Genomes, a database resource for using molecular-level infomraiton (large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies) for understanding living systems.

Protein Structural Analysis


NextProt Exploring the univers of human proteins - includes links to many other useful databases and tools.

Protein Data Bank (PDB) a database of experimentally determined protein structures.

UniProt UniProt provides protein sequence and functional information.

COBALT a multiple protein sequence alignment using conserved domain and local sequence similarity information. Align sequences using domain and protein constraints.

ExPASy Logo   access to scientific databases and software tools including proteomics, genomics, etc.

The Cambridge Crystallographic Data Centre (CCDC) Comprehensive database of crystal structures (not proteins). 

ChemDraw (use Cal State LA site license):

ImageJ, an open source image processing program: or


Protein Motifs (Consensus Sequences and Structural Motifs), Cleaving Sites and Metal Binding

MOTIF Search


ScanProsite tool

Cleave Predict PMAP protease specificity database


Degradome (info on all main proteases)

Cysteine Oxidation Prediction Program (COPP):

Does a metal bind to your protein? CheckMyMetal (CMM): Metal Binding Site Validation Server:

Protein Post-Translational Modification Prediction

A post-translational modification is a covalent processing event resulting from a proteolytic cleavage or from the addition of a modifying group to one amino acid. So far, more than 350 PTMs have been characterized. They modulate the function of most eukaryote proteins by altering their activity state, localization, turnover, and interactions with other proteins. Although proteins can be modified pre-, co- or post-translationally, all protein modifications are generally referred to as PTMs, because a majority of them are made post-translationally, after the protein is folded.

How many PTMs does your protein have?

It is believed that there are over 170 databases and computational tools for PTM analysis. The Cuckoo Workgroup strives to maintain up-to-date resource list: This site also has good PTM prediction tools. 




FindMod is a tool that can predict potential protein post-translational modifications (PTM) and find potential single amino acid subsitutions in peptides.

N-Terminal PTM predication


PhosphoNET - PhosphoNET is an open-access, online resource developed by Kinexus Bioinformatics Corporation to foster the study of cell signalling systems to advance biomedical research in academia and industry.

Prediction of lysine methylation and lysine acetylation: PLMLA

How Common Are PTMs? PTM Statistics Curator: Automated Curation and Population of PTM Statistics from the Swiss-Prot Knowledgebase. An automated computational method for quantifying the number of each post-translational modification reported experimentally and non-experimentally in the Swiss-Prot Knowledgebase. Non-experimental qualifiers (Potential, Probable, By Similarity) are consistent with those defined in Swiss-Prot. This method will automatically load the new PTM IDs, curate them, and quantify and even categorize by organism this information every month as the Swiss-Prot Knowledgebase is updated.

PTM Structureal Database

RESID Database of Protein Modifications a comprehensive collection of annotations and structures for protein modifications including amino-terminal, carboxyl-terminal and peptide chain cross-link post-translational modifications.

Databases for major PTMs

What happens if there is a mutation on an amino acid that is supposed to be phosphorylated? MIMP Predicting the impact of mutations on kinase-substrate phosphorylation. 

NetSurfP server predicts the surface accessibility and secondary structure of amino acids in an amino acid sequence. The method also simultaneously predicts the reliability for each prediction, in the form of a Z-score.

PTM Structure Modeling:

Arginine Methylation
Prediction Tools

MeMo: (Chen, et al., 2006)

BPB-PPMS(Shao, et al., 2009)

MASA(Shien, et al., 2009)

PLMLA: PLMLA is an in silico online tool for prediction of potential lysine methylation and lysine acetylation from protein sequences. (Shi, et al., 2012)

PMeS: PMeS is a web tool for identifying protein methylation sites based on enchanced feature encoding scheme and support vector machine. (Shi, et al., 2012)

iMethyl-PseAAC: The web-server iMethyl-PseAAC is a web server that could predict methylation sites in proteins. (Qiu, et al., 2014)

iLM-2L: A two-level predictor for identifying protein lysine methylation sites and their methylation degrees by incorporating K-gap amino acid pairs into Chou׳s general PseAAC. (Ju, et al., 2015)

GPS-MSP: In this work, we adopted GPS 3.0 algorithm and built GPS-MSP (Methyl-group Specific Predictor) for the prediction of general or type-specific methylline and methylarginine residues in proteins. (Deng, et al., 2016)

iPTM-mLys: The web-server iPTM-mLys is used to predict the identifying Lysine PTM sites and the modified type(s) in proteins. (Qiu, et al., 2016)

PSSMe: PSSMe is a tool for identifying species-specific methylation sites based on information gain (IG) feature optimization method. (Wen, et al., 2016)

MePred-RF: To predict methylation sites, we develop a machine learning based predictor called MePred-RF. (Wei, et al., 2017)

PRmePRedPRmePRed is a SVM based prediction tool to predict arginine methylation sites in proteins. (Kumar, et al., 2017)

Met-predictor: This predictor is developed to predict lysine and arginice methylation sites based on support vector machine (SVM) classifier. It is supplied in source code form along with the required data files and run under the linux. The input is a protein sequence file (fasta format). (Zheng, et al., 2020)



Protein Structure Prediction

SWISS-MODEL is a fully automated protein structure homology-modelling server, accessible via the ExPASy web server, or from the program DeepView (Swiss Pdb-Viewer). The purpose of this server is to make Protein Modelling accessible to all biochemists and molecular biologists worldwide.

Rosetta Commons information on the Rosetta software suite - algorithms for computational modeling and analysis of protein structures.

A compendium of many protein analysis tools by Robert B. Russel, Bioinformatics, Research & Development, SmithKline Beecham Pharmaceuticals:

SCOP: Structural Classification of Proteins -provides a broad survey of all known protein folds, detailed information about the close relatives of any particular protein, and a framework for future research and classification.

EvoDesign an evolutionary profile based approach to de novo protein design. Starting from a scaffold of target protein structure, EvoDesign first identifies protein families with similar folds from the PDB library by TM-align.

Protein-Protein Interactions

STRING Protein-Protein Interaction Networks.

The ConSurf Server - server for the identification of functional regions in proteins

Human Protein Atlas

Small Molecules and Drugs

Drug Report developed to collect available scientific data on marketed drugs. The information can help scientists, doctors and patients to stay tuned for latest knowledge of drugs.

RxList. Medications and prescription drug information for consumers and medical health professionals.

About Herbs. Expert advice and information on supplements, integrative medicine treatments, and more.

Lab Materials and Equipment

Science Exchange

ABRF Core Market Place

Collecting and Presenting Data


Useful Guidelines on Collecting Data from the Journal of Biological Chemistry here.