|
| BioE131/231
Introduction to Computational Biology.
Course info
- Code: Bioe131/231
- Title: Introduction to computational biology
- Instructor: Ian Holmes (follow the link to my page for my office hours)
- When:
- Lecture log
Calendar & announcements
Lectures
Lecture slides are now posted on the BioE131 lecture notes page.
Exercises
The following self-directed exercises may be used to assess knowledge of examinable material, ahead of midterms.
These exercises are not for credit, except where otherwise stated.
See also the BioE131 fact sheet.
- Grading scheme
- 40% homework assignments
- 25% midterm exams
- 35% final project
- Extensions/Alternate Exam Dates
- Requests for extensions on homework due dates must be submitted to Ian Holmes via email at least 2 days before homework due date, clearly stating reason(s) for request.
- Please plan to attend the scheduled exam/presentation times. Requests for alternate exam/presentation dates must be submitted to Ian Holmes via email as soon as possible, clearly stating reason(s) for request. An alternate date request is only likely to be granted under extreme circumstances (family emergencies, major illness, etc.)
- Conflicting commitments on other courses will not usually be considered adequate reason to moving exams or homework deadlines unless the conflict is experienced by a majority of the class.
- Every year several requests failing to meet the above criteria are declined -- please do not invite mutual disillusion by asking for an exception.
- Teamwork
- We want to encourage you to work together, so for regular homework assignments, you may submit jointly with up to one other student as long as you identify who it was you worked with. However, we also want to encourage mixing, so you can contribute no more than three homework assignments with any one partner. After that you will have to rotate with someone else.
Student wiki
Syllabus
Approximate sequence of lectures (see also BioE131 weekly schedule):
- Introductory case study
- Overview of syllabus; available means of assessment..
- group & individual presentations; literature reviews; class participation; homework; exam(s); project
- Review of fundamental molecular biology
- Biophysical principles of RNA and protein folding
- Overview of biological databases
- Introduction to Unix
- Introduction to Perl programming: loops, variables, subroutines; file manipulation; data structures
- Assemblers, compilers, interpreters & virtual machines: machine code, C, Perl and Java
- Biophysics of synthetic biology: RNA folding kinetics & viral genome design
- Sequence alignment algorithms: Needleman-Wunsch, Smith-Waterman, Gotoh, BLAST
- Genome annotation; biological ontologies, pathway databases
- Probabilistic inference; Bayes' theorem; experimental error; expectation and variance
- Quick refresher in basic distributional analysis...
- Basic combinatorics; binomials and multinomials
- Geometric, exponential, Poisson distributions
- Gaussian distribution; mixture distributions
- Quantitative measures of information; illustration via data compression
- Log-likelihood ratios and substitution matrices; coding & cryptography
- Probabilistic models for sequence motifs; "sequence logos"
- Algorithmic complexity & "big-O" notation: examples from compbio
- Finite state machines; multiple alignment; phylogenetic reconstruction
- Rate variation, evolutionary trace and phylogenetic profiling; applications to design
- Structural biology, protein structure prediction, protein design
- Clustering algorithms: K-means, K-medians; application to microarray data analysis
- Sequence assembly & metagenomics; examples (human microbiome; bioenergy)
- Computational biology at Berkeley
Lab practicals
- Unix
- Using Wiki
- Biological Databases
- Perl Basics
- Perl Hashes & Arrays
- Perl Pattern Matching
- RNA folding
- Sequence Alignment
- Information Content of DNA
- Bacterial Gene Prediction
- Primate Phylogeny
- Pathway Mining
- Protein Visualization
- Catch-up lab; project work
Homework exercises
Homework exercises will be assigned in labs and posted on the individual lab pages.
Programming assignments will be graded both for form (style) and function (correctness). Stylistic expectations will be outlined on the style guidelines page by the time the first assignment is given.
Textbooks
No textbook purchase is required to take the class.
References to the following textbook (which can be freely downloaded) appear occasionally as recommended reading:
- The MacKay Book: MacKay, DJC. Information Theory, Inference and Learning Algorithms. ISBN:0521642981
- Can be downloaded from here (copyright grants permission to view but not print)
The following Perl books from O'Reilly may be a useful supplement to what's taught in class:
- The following are the two "classic" Perl tutorial and reference books:
- The "Perl for Bioinformatics" series have more bioinformatics-oriented examples:
Other resources
Dates
Final project dates
Final project
TBD
Labs, homeworks and holidays
|