click on the Biowiki logo to go to homepage

Research | Teaching Blog
Biowiki > Blog > SummerOfCode

Search

Advanced search...

Topics

Last 2 entries
Last 4 entries
Last 8 entries
Last 16 entries
Last 32 entries
Last 64 entries
Last 128 entries
Last 256 entries



Links
misc bio links
Open bio projects
Bio wikis, blogs
Meetings, journals
off-site links
Rfam / Pfam
GMOD/GBrowse
UCSC, Ensembl
Bioperl wiki
AAA fly wiki
ISCB; RNA society


journals
Nature, Science
Cell, PLoS Biol
Nature Genetics
Genome Research
Genome Biology
PLoS Comp Bio
NAR, Mol Biol Evol
J Comp Biol
BMC Bioinformatics
Bioinformatics
Phys Rev Lett


bioinformatics blogs
Life Scientists room
Neil Saunders
Roland Krause
Pedro Beltrao
Jason Stajich
Euan Adie
Lars Jensen
Deepak Singh
Cyrus Harmon
fungalgenomes.org
nodalpoint.org
Yokofakun
Propeller Twist
bioinformatics.org
Andrew Dalke
Personal Genome
iSpecies blog
Inforbiomatica
hublog
Hocus Pokus


bio/science blogs
Biocurious
Declan Butler
Nascent
Pharyngula
Bioethics.net
Free Association
young female scientist
OpenScience project
genetics+health
evolgen
post-genomics
gene expression
Informaticopia
Tangled Bank
Cavalcade of Mammals
Triplepoint
panda's thumb
stranger fruit
Genetic Chaos
In the Pipeline


tech/scifi blogs
xkcd
Bruce Schneier
Raph Koster
io9
27B/6
William Gibson
Bruce Sterling
Rudy Rucker
The Register
The Scientist
jay is games
geekdad
videogameworkout
Red Orbit
+ve tech journal
Laughing Meme
Respectful Insolence
Braintickle


collaborators
AlexBateman
CaseyBergman
EwanBirney
RachelBrem
StevenBrenner
MichaelBrent
BillBruno
JamieCate
SueCelniker
JenniferDoudna
RichardDurbin
SeanEddy
MikeEisen
NickGoldman
RichardGoldstein
DavidHaussler
AndreasHeger
JotunHein
SteveHolbrook
AndrewHolmes
JohnHuelsenbeck
MarcinJoachimiak
SamGriffithsJones
SuziLewis
GertonLunter
IstvanMiklos
VladimirMinin
ChrisMungall
JanetNewman
RasmusNielsen
LiorPachter
BenedictPaten
TomPeat
ChrisPonting
DavidSchaffer
KimmenSjolander
GuySlater
JasonStajich
LincolnStein
MarcSuchard


PageRank Checker

Google Summer Of Code 2007: project submissions due March 26th

Our Entries: Ajax Phylo-Informatics, Automated Pipeline Tools

NESCENT has a page up about Google's 2007 summer of code.

Here's my contribution so far (excerpted from the NESCENT phyloinformatics wiki):

Evolve Unix phyloinformatics tools into Ajax applications

Rationale

Many powerful new tools for phylogenetic stochastic grammar analysis of multiple alignments, such as xrate or PHAST, as well as PAML etc, are available only from the Unix command line. These tools need to become operable over the web, especially via Javascript platforms such as the new Google Maps-like interface to GBrowse.

Approach

Use toolkits such as dojo to build asynchronous javascript wrappers for Unix tools (probabilistic modeling & phlogeny tools, format conversion utilities, sequence analysis & alignment software, genome annotation pipelines, grids & job queues, realtime parallelizable systems); other Javascript/web components (alignment viewers, tree viewers & navigators, genome browsers); and bioinformatics "mashups". Interface with gmod-ajax, Amigo and other web-based bioinformatics platforms.

Challenges

Adapting command-line tools to for web use; creating an asynchronous user interface; developing infrastructure for mashable bioinformatics...

Involved toolkits or projects

BioPerl/Biopython/Bioruby; SWIG; dojo; Sun Grid Engine; Erlang

Mentors

IanHolmes, MitchSkinner, ChrisMungall, JasonStajich

Ideas pages

AjaxPhyloinformatics; see also WishList, RnaAlignmentViewer

Extending the "make" paradigm for bioinformatics annotation pipelines

Rationale

Annotating a genome, or performing other large-scale bioinformatics analyses, typically involves a series of operations with sequential dependencies but also strong parallelism. The GNU make program is one robust approach that is often used to build such analysis pipelines, but suffers serious drawbacks for bioinformatics (e.g. no built-in database access; extremely limited pattern-matching; language is not extensible; dependencies are triggered only by file timestamps and not e.g. MD5 hash indicating file contents have changed).

Approach

The project will involve building a replacement or upgrade to "make". One possible approach will be to use a declarative language with (i) strong support for distributed processing, (ii) easy-to-use Unix "hooks" (c.f. make), (iii) database and filesystem access. Examples of candidate languages include Erlang and Termite Scheme. Alternatively, C-inclined students may start with an existing parallel "make" clone, such as qmake or distmake.

Challenges

The first challenge is to get something that is as convenient to use as "make" for migrating throwaway command-lines and analysis scripts into robust pipeline stages. Subsequent challenges will include database access, flexible pattern-matching and enhanced dependency triggers.

Involved toolkits or projects

Erlang, Termite Scheme, distmake/qmake, or other.

Mentors

IanHolmes, ChrisMungall

Ideas pages

BioMake, ErlangLanguage

Some links from Google:

-- IanHolmes - 05 Mar 2007

Actions: Edit | Attach | New | Ref-By | Printable view | Raw view | Normal view | See diffs | Help | More...