Amino acid sequence
Last updated
Last updated
An amino acid (as illustrated in Figure) is an organic molecule that is made up of a basic amino group (−NH2), an acidic carboxyl group (−COOH), and an organic R group (or side chain) that is unique to each amino acid. The order of amino acids as they occur in a polypeptide chain, and It is of fundamental importance in determining protein conformation.
A multiple sequence alignment (MSA), as illustrated in Figure 2, is a sequence alignment of three or more biological sequences, generally protein, DNA, or RNA. Multiple sequence alignments can be useful in many circumstances, e.g., detecting historical and familial relations between sequences of proteins or amino acids and determining certain structures or locations on sequences.
Before one starts to launch Leri, a multiple sequence alignment (MSA) is required. Gernerally, Leri does calculations from two sources of MSAs, one is from HMMER software, and the other one is from HHblits tool. Firstly, locally install HMMER software and download UniRef database here. The MSA can be also obtained from the HMMER web-server if you don't want to locally install the software suite.
Run jackhmmer (HMMER) to prepare the MSA by search against the Uniref** database. A simple script that illustrates how to apply the jackhmmer to search a query protein sequence is show as follows,
Run hhblits with default parameters at an E-value threshold 0.001 as shown,
When you get the multiple alignment of hits from the jackhmmer and hhblits, it is time to launch the Leri to convert the file to the standard FASTA format. The command line that converts *.sto file to FASTA file is presented here,
Trim aligned sequences according to the query sequence.
Basic statistics on the aligned sequences,