Difference between revisions of "Protein modelling"
From Opengenome.net
| Line 13: | Line 13: | ||
* Align the amino acid sequence of the target protein with that(those) of the template protein(s) (program: CLUSTALW, output: PIR format) | * Align the amino acid sequence of the target protein with that(those) of the template protein(s) (program: CLUSTALW, output: PIR format) | ||
* Build 3D structural models based on the multiple sequence alignment and the template 3D structure(s) (program: MODELLER, input: ATM, ALI, and TOP files) | * Build 3D structural models based on the multiple sequence alignment and the template 3D structure(s) (program: MODELLER, input: ATM, ALI, and TOP files) | ||
| − | manual build | + | ** manual build |
| − | manually convert file formats from PDB to ATM and PIR to ALI, respectively | + | *** manually convert file formats from PDB to ATM and PIR to ALI, respectively |
| − | write a TOP script file | + | *** write a TOP script file |
| − | run MODELLER | + | *** run MODELLER |
| − | automatic build (some PDB and PIR files are not successfully converted to correct ATM and ALI files due to amino-acid sequence mismatch) | + | ** automatic build (some PDB and PIR files are not successfully converted to correct ATM and ALI files due to amino-acid sequence mismatch) |
| − | perl script to generate ATM, ALI, and TOP files from PDB and PIR files: [[perl script for ATM|txt]], gz | + | *** perl script to generate ATM, ALI, and TOP files from PDB and PIR files: [[perl script for ATM|txt]], gz |
| − | + | *** command-line arguments in order: base_directory_path pir_file_name target_protein_id(exactly 4 letters) one_or_more_PDB_file_names(exactly 4-letter prefix) | |
| − | command-line arguments in order: base_directory_path pir_file_name target_protein_id(exactly 4 letters) one_or_more_PDB_file_names(exactly 4-letter prefix) | + | *** run the perl script (e.g. ../bin/makeModellerInput.pl /home/user/Protein3DModelling/KCIP KCIP.pir KCIP 1QJA.pdb) |
| − | run the perl script (e.g. ../bin/makeModellerInput.pl /home/user/Protein3DModelling/KCIP KCIP.pir KCIP 1QJA.pdb) | + | *** run MODELLER (e.g. mod7v7 KCIP.top) |
| − | run MODELLER (e.g. mod7v7 KCIP.top) | + | ** example |
| − | example | + | *** ATM file: 5fd1.atm |
| − | ATM file: 5fd1.atm | + | *** ALI file: alignment.ali |
| − | ALI file: alignment.ali | + | *** TOP file: model-default.top |
| − | TOP file: model-default.top | + | *** command: mod7v7 model-default.top |
| − | command: mod7v7 model-default.top | + | *** output PDB file (predicted model): 1fdx.B99990001.pdb |
| − | output PDB file (predicted model): 1fdx.B99990001.pdb | + | ** MODELLER manual |
| − | MODELLER manual | + | *** PDF, HTML |
| − | PDF, HTML | + | * Display and compare the 3D structures of the model and the template proteins (program: RasMol) |
| − | Display and compare the 3D structures of the model and the template proteins (program: RasMol) | ||
Revision as of 09:35, 26 August 2005
http://biome.ngic.re.kr/ProteinModelling/
Problem (문제정의)
- Build 3D structural models of given or interested proteins
- Initial input: amino acid sequence of the target protein
- Final output: its predicted 3D structure
Method
- Find one or more structural templates of the target protein via sequence homology search against known structures (program: BLASTP or PSI-BLAST, database: PDB)
- Download template structures in PDB format (web server: RCSB PDB)
- Align the amino acid sequence of the target protein with that(those) of the template protein(s) (program: CLUSTALW, output: PIR format)
- Build 3D structural models based on the multiple sequence alignment and the template 3D structure(s) (program: MODELLER, input: ATM, ALI, and TOP files)
- manual build
- manually convert file formats from PDB to ATM and PIR to ALI, respectively
- write a TOP script file
- run MODELLER
- automatic build (some PDB and PIR files are not successfully converted to correct ATM and ALI files due to amino-acid sequence mismatch)
- perl script to generate ATM, ALI, and TOP files from PDB and PIR files: txt, gz
- command-line arguments in order: base_directory_path pir_file_name target_protein_id(exactly 4 letters) one_or_more_PDB_file_names(exactly 4-letter prefix)
- run the perl script (e.g. ../bin/makeModellerInput.pl /home/user/Protein3DModelling/KCIP KCIP.pir KCIP 1QJA.pdb)
- run MODELLER (e.g. mod7v7 KCIP.top)
- example
- ATM file: 5fd1.atm
- ALI file: alignment.ali
- TOP file: model-default.top
- command: mod7v7 model-default.top
- output PDB file (predicted model): 1fdx.B99990001.pdb
- MODELLER manual
- PDF, HTML
- manual build
- Display and compare the 3D structures of the model and the template proteins (program: RasMol)