Kevin Karplus's Lab group

(Last Update: 13:18 PDT 12 May 2008 )

This web page is for Kevin Karplus's group in protein-structure prediction (and, eventually, protein design). The page has not been fully created yet, so you'll probably find more interesting stuff on the members' home pages.

Spring 2008 individual meeting schedule

(Note: not on campus on Thursdays this quarter.)

time Monday Tuesday Wednesday Friday
10:00 COT(Kerr129)
10:30 COT(Kerr129)
11:00 COT(Kerr129) John Kim
11:30 COT(Kerr129) Steven Hobbs
12:00 BME 280B John Archie
12:30 BME 280B
1:00 Josue BME 280B
1:30 Josue BME 280B
2:00 Firas George ?? Grant Open office
2:30 Firas George ?? Grant Open Office
3:00 BME 281K BMEfac
3:30 BME 281K BME 281G BMEfac
4:00 BME 281K BME 281G bread and tea
4:30 Alex Atkins bread and tea
5:00 bread and tea
5:30

Spring 2008 Group Meetings

Lab meetings will be Mondays 3:00-4:30 in
PSB 305 (the conference room).
datespeakertopic/paper
31 March 2008 Glenn Millhauser Task classifying GPCRs
7 April 2008 Josue Samayoa Daniela Röthlisberger, Olga Khersonsky, Andrew M. Wollacott, Lin Jiang, Jason DeChancie, Jamie Betker, Jasmine L. Gallaher, Eric A. Althoff, Alexandre Zanghellini, Orly Dym, Shira Albeck, Kendall N. Houk, Dan S. Tawfik, David Baker
Kemp elimination catalysts by computational enzyme design.
Nature advance online publication 19 March 2008
doi:10.1038/nature06879
14 April 2008 John Archie Jian Qiu, Will Sheffler, David Baker, William Stafford Noble.
Ranking predicted protein structures with support vector regression.
Proteins: Structure, Function, and Bioinformatics 2007
DOI: 10.1002/prot.21809
21 April 2008 Martin Madera
Wed 23 Apr 2008 Janine Ilagan (Jurica lab) STRUCTURE CLUB Sinsheimer 123 12:30-1:40
28 April 2008 Firas Khatib practice interview
5 May 2008 Alex Atkins Johannes Söding
Protein homology detection by HMM-HMM comparison
Bioinformatics 2005 21(7):951-960;
doi:10.1093/bioinformatics/bti125
12 May 2008 John Kim Jesper Lundstr&ounl;m, Leszek Rychlewski, Janusz Bujnicki, and Arne Elofsson.
Pcons: A neural-networkbased consensus predictor that improves fold recognition
Protein Science (2001), 10:2354-2362. http://www.proteinscience.org/cgi/content/abstract/10/11/2354
Fri 16 May 2008 Grad research symposium
19 May 2008 California State Science Fair no meeting
Wed 21 May 2008 (Rubin lab) STRUCTURE CLUB PSB 240 12:30-1:40
26 May 2008 Memorial Day no meeting
2 June 2008 Steven Hobbs back propagation
Thu 5 June 2008 Undergrad poster symposium

Unscheduled papers

Old schedules

After each quarter, I move the meeting schedule to old-schedules.html, so that I have a rough record of what we talked about.

Register for class

Graduate students attending the protein structure club and lab group meetings should register for BME 281K (Seminar on Protein Structure Prediction). This is in addition to any independent study or thesis work they are doing with me.

Current projects

There are always more projects to do than people to do them. I used to keep a partial list of projects that need doing at http://www.soe.ucsc.edu/~karplus/protein-projects.html but I have not been keeping it up to date. Instead, I've been creating lists for the BME 220 (Protein Informatics) class: The older lists may still contain viable projects that I have not brought forward to the newer lists.

Lab Members

Kevin Karplus
Professor of Biomolecular Engineering, head of lab, master programmer for undertaker, predict-2nd, estimate-dist, correlated-columns, and C++ library.

Richard Hughey
Professor of Computer Engineering, master programmer for SAM tool suite.

Dietlind Gerloff
Assistant Professor of Biomolecular Engineering, structure predictor

George Shackelford
photo of George Shackelford, taken by Branwyn bioinformatics grad student, contact prediction in proteins Participated in CASP7.

Grant Thiltgen
MS/Phd student in bioinformatics. Summer 2005-current. Working on testing new H-bond-based local-structure alphabet. Participated in CASP7.

Firas Khatib
PhD student (inherited from Carol Rohl's lab). Currently writing up work done with Carol on finding knots in protein conformations generated by Rosetta. Working on generalizing knot-finding to "poke-finding" (pokes are slip knots). Participated in CASP7.

Josue Samayoa
PhD student (inherited from Carol Rohl's lab). Finished MS thesis on using residual dipolar coupling data to aid in choosing alignments of tragets to templates. Working with Fitnat Yldiz on simple repeats and short proteins in Vibrio cholerae.

John Archie
Started as rotation student W07. Worked on including predicted burial, bys, and protein-block local structure alphabets into undertaker cost function. Getting into protein design.

Hyunsung John Kim
Started as a Masters student Spring 2008, after BME 220 as an undergrad in Spring 2007. Made some modifications to undertaker for analyzing packing of aromatic rings in BME 220 project, currently working on per-residue model quality assessment using undertaker.

Steven Hobbs
Started as an undergrad Winter 2008, working on trying to get Rprop implemented in predict-2nd.

Alex Atkins
Started as an undergrad in Winter 2008 in BME 220, working on trying to get HMM-HMM scoring and alignment working in SAM's hmmscore.

Dietlind's students

Thomas Juettemann
Dietlind's student from Edinburgh, visiting UCSC starting July 2007.

Research Topics

The lab mainly works on protein-structure prediction, though we hope to get into protein design in the near future. We work on almost all aspects of protein-structure prediction including remote homology detection using interated search and hidden Markov models, alignment of sequences to structures using multi-track hidden Markov models, prediction of local structure properties using neural nets, prediction of residue contacts using mutual information, and tertiary structure prediction using conformation generation and scoring.

We have participated in all but the first of the CASP (Critical Assessment of Structure Prediction) experiments and have done well in CASP2, CASP3, CASP4, CASP5, CASP6, and CASP7. All our working notes and files for CASP6 and CASP7 are available on the web at http://www.soe.ucsc.edu/~karplus/casp6 and http://www.soe.ucsc.edu/~karplus/casp7 A rough assessment of the CASP6 results is available in http://www.soe.ucsc.edu/~karplus/casp6/assessment.html

Publications

Publications are now on a separate page, but see also Kevin Karplus's paper list.

Software and services available

Released Software:

We currently have two software packages available:
SAM
The premier suite of hidden Markov Model tools, originally created by Anders Krogh, extended and maintained by Richard Hughey.
gen_sequence
Open-source (C) code for generating random sequences, also includes generating random numbers from beta and Gaussian distributions, and random vectors from mixtures of Dirichlet distributions.

Unreleased Software: (pointers needed for UCSC access only)

We hoped to be able to distribute the following software in the near future, but NIH turned down a grant application to fix them up enough to make them open-source, so it is not clear when we'll have the spare resources to do it:
"ultimate" C++ class library
undertaker
predict-2nd
estimate-dist
correlated-columns

Web Services and databases

SAM-T06 alignment, HMM, database query, secondary structure predictions, residue-residue contact prediction, 3D-structure prediction
UCSC's SAM-T06 method for iterative SAM HMM construction and remote homology detection and protein structure prediction updates SAM-T02 by using more sensitive iterated search, more local structure prediction, residue-residue contact prediction, and full 3D structure prediction. information in its scoring functions. SAM-T99, SAM-T02, and SAM-T06 were all server entries to CASP7, and SAM-T06 was the best of the three.

Submit a target protein sequence in FASTA format and receive a web page full of predictions. This is our current best-performing server.

SAM-T02 alignment, HMM, database query, secondary structure predictions and more
UCSC's SAM-T02 method for iterative SAM HMM construction and remote homology detection and protein structure prediction updates SAM-T99 by using predicted secondary structure information in its scoring functions. Both SAM-T99 and SAM-T02 are "automatic" entries to CASP5 and CASP6.

Submit a target protein sequence in FASTA format and receive SAM-T02 alignment, HMM, protein database query, secondary structure predictions, sequence logos and pairwise alignments of target to top database hits.

Yeast protein predictions
We have pre-computed SAM-T02 predictions for all the ORFs of S. cerervisiae, and created a web page similar to the results returned by the SAM-T02 server. The web pages are not currently indexed---to find a protein like YBL008W you have to go to the subdirectory YBL0/YBL008W and get the summary.html file. We update some of the yeast predictions each quarter, based on new structures released in PDB. Some of the newer updates use the SAM-T06 protocol.
SAM-T99 alignment, HMM, protein database query, and secondary structure prediction
Submit a protein sequence (or alignment) in FASTA format and receive SAM-T99 alignment, HMM, database hits, and secondary structure prediction. This site has been mostly superseded by the SAM-T02 site, but we'll keep it running as long as we can, since several meta-servers use it. We urge anyone doing new work to use the SAM-T06 server instead, since it produces more accurate results. The T99 server may go away as we move to a new cluster, as the code is very messy and hard to port.

Lab Alumnæ and Alumni

This list is not really complete. I have only recently started keeping the list, and my memory is notoriously poor. I know I have omitted many undergrads who worked with me, and I relied on publication lists to try to get the complete list of grad students who worked with me, so anyone who didn't publish probably got overlooked. I've also limited the list to bioinformatics projects, omitting students who worked with me back when I did VLSI design and CAD for VLSI.

Martin Madera
Postdoc from May 2006 to Apr 2008. Participated in CASP7. Mainly spent his time writing grant proposals in an unsuccessful attempt to get some stable funding for himself. Revamped our fold-recognition and alignment tests.

Martin Paluszewski
Pawel Winter's student from Denmark, visited UCSC for 6 months, starting July 2007. Working on extracting constraints from alignments.

Joanna Sharman
Dietlind's student from Edinburgh, visited UCSC for a short time starting August 2007.
Pinal Kanabar
MS student. Started work on testing a new H-bond-based local-structure alphabet in Spring 2005 as prospective student (now Grant's project). Participated parttime in CASP7.
Greg Dougherty
MS student. Working on interfacing ProteinShop and undertaker (mainly as BME 220 project, W06).
Zack Sanborn
PhD student. Lab rotation Winter 06, worked on protein design using back-propagation in neural nets and RosettaDesign. Participated parttime in CASP7.
Chris Wong
MS student. Worked on sequence recovery testing of Design1st in Predict-2nd. Participated parttime in CASP7.
Mark Mitchell
PhD student. Lab rotation Winter 06, worked on confirming that gamma distribution was appropriate for mutual information null model. (deceased)
Oscar Hur, PhD (postdoc, roughly 2002-2005)
Published paper on converting hydrogen distance restraints into heavy-atom constraints. Working on an HMM for detecting and aligning TIM barrels.
Sol Katzman
bioinformatics grad student---worked for group for CASP6 (Jan-Sept 2005), but has chosen a different lab after first-year rotations.
Wing Wong (2004-2005)
MS student in CS at San Jose State. Worked on profile-profile alignment in SAM, but decided to stick with CS.
Jes Frellsen
undergraduate from Denmark (Winter+Spring 2005). Worked on using backpropagation in local-structure neural nets to do protein design.
NavyaSwetha Davuluri
Undergrad. Working on tuning undertaker cost function for rotamer optimization. Participated in CASP7.
Erich Blume
Undergrad. Working on tuning undertaker cost function for rotamer optimization.
Sylvia Do
Undergrad. Participated in CASP7.
Crissan Harris.
Undergrad. SURF-IT student. Participated in CASP7.
Cynthia Hsu.
Undergrad. SURF-IT student. Participated in CASP7.
Suma Potluri
MS student in computer science. Started project in W06 on converting neural-net probability vector output for burial alphabet to continuous prediction for burial
Ron Chao
bioinformatics grad student. Winter 2005. Was going to look at template or alignment selection for close homologs, but drifted off to other labs.
Cameron Reid
has BS in biology and CS from UCSC. Took BME 100. Spring 2005. Started working on using inverse kinematics to do "wiggling without breaking" as a conformation-change operator in undertaker, but drifted off before implementing anything.
Carl Gorringe
Undergraduate (inherited from Carol Rohl's lab). Winter 2005. Started working on a project with David Bernick to predict functional residues from differences in conservation between homologs and "morphologs", but dropped the project due to overload.
Rachel Karchin
A UCSC undergrad and grad student. worked on CASP4 and CASP5. Finished her PhD under Karplus's supervision in June 2003. Currently working as a postdoc in Šali's lab at UCSF.
Christian Barrett
Finished his PhD under Karplus's supervision in 2001. Worked for a year in Iceland and is currently doing a postdoc in Palsson's lab at UCSD. Participated in CASP-2 and CASP-3.
Melissa Cline
jointly advised with David Haussler and Lydia Gregoret, worked with Karplus on several projects. Finished her PhD in June 2000. Participated in CASP-2 and CASP-3.
Kimmen Sjölander
jointly advised with David Haussler, but worked with Karplus on several projects. Finished her PhD in 1997. Currently on the faculty at UC Berkeley. Participated in CASP-2.
Leslie Grate
advised by Richard Hughey, did at least one project with Karplus (finding homolog of mamalian antizyme in S. pombe, but not S. cerevisiae). Finished his PhD in 1999. Currently at Lawrence Berkeley Lab, but looking for a more productive research position. Provided some software for CASP-2 and CASP-3.
Mark Diekhans
advised by David Haussler, but has provided some software support for the Karplus lab, mainly in the form of scripts for downloading and parsing PDB files. Has also worked on secondary-structure prediction using posterior decoding of HMMs.
Sugato Basu
Finished his Master's degree in 2000. Worked on the Transform class in the C++ library and some other fundamental code for the undertaker program. Now in Raymond Mooney's machine-learning group at UT Austin, working on semi-supervised clustering, both of text and microarray data, PhD expected summer 2005.
Birong Hu
Finished her master's degree in 2000. Worked on using BaliBase to test SAM-T99 as a multiple-alignment method. Had a prior PhD in biology from UCLA.
David Lin
jointly advised by Haussler. Finished his Masters in 1999, worked with Karplus on converting protein HMMs to models for searching EST databases.
Pernille Nielsen
a visiting grad student (Oct 04-Feb 05), from Anders Krogh's lab in Denmark. Working on support-vector machines and HMMs to classify short ORFs as protein-coding or non-protein coding.
Marcia Soriano
bioinformatics senior, evaluating secondary-structure predictors. Graduated March 05.
Bret Barnes, undergrad
bioinformatics senior, track weighting for multi-track HMMs to optimize alignment. Graduated 2005.
Adetunde Famakinwa Adekunle (Tunde).
Undergrad summer 2005. Worked on training local-structure nerual nets.
Rudy Ortegon.
Undergrad summer 2005. Worked on implementing priority queues.
Mamie Jallow
Undergrad from Univ of Washington. summer 2005. Worked on structure predictions for extracellular domains of chemotaxis proteins from Vibrio cholerae.
Don Speck
Don was never officially part of the Karplus group, but did a lot of work across the hall in the Kestrel lab and frequently interacted with Richard Hughey and Kevin Karplus. One of his projects (on reduced-space sequence alignment) was particularly important to the SAM suite. He is now working as a VLSI designer.
Spencer Tu
Worked on a general HMM program for several years. Architect of the SAM-T99 web service. Participated in CASP-4.
Jenny Draper
bioinformatics grad student, currently working in Karen Ottemann's lab. Participated in CASP-5 and CASP-6.
Jonathan Casper
computer engineering grad student. Participated in CASP-5. Did some work on scoring hydrogen bonds.
Martina Koeva
bioinformatics grad student, currently working in Josh Stuart's lab. Participated in CASP-6 predictions.
Yael Mandel-Gutfreund
Was a postdoc for Lydia Gregoret, but supervised by Karplus for a year after Gregoret left UCSC. Created the str and str2 local structure alphabets. Participated in CASP-5. Now on the faculty at the Technion Institute of Technology.


slug icon to go to Scool of Engineering home page
SoE home
sketch of Kevin Karplus by Abe
Kevin Karplus's home page
BME-slug-icon
BS, MS, and PhD programs
Karplus's lab page UCSC Bioinformatics research

Questions about page content should be directed to

Kevin Karplus
Biomolecular Engineering
University of California, Santa Cruz
Santa Cruz, CA 95064
USA
karplus@soe.ucsc.edu
1-831-459-4250
318 Physical Sciences Building