-- %TEACHINGWEB%.TahouraSamad - 08 Oct 2011
Homework 3
My program for this assigment is attached as simulator.pl.txt, and the FASTA files I tested it on are attached as testsequence1.txt and testsequence2.txt. Given composition parameters, it can load parameters and generate random sequences for 1 or more sequences in a FASTA file. Given a FASTA file with 1 or more sequences, it can generate random FASTA files with the same composition of the original sequences in the FASTA file.
I've also included samplefile.txt, a sample txt file that I load parameters from and that my program saves parameters to. The file contains the name, the total length, number of each nucleotide in the sequence as well as the percentage of each nucleotide in the sequence for each sequence in a fasta file. All lines are preceded by a > except for the name, which is preceded by >> and the length and number of each basepair, which are preceded by nothing. This makes it easy for the program to find the number of each nucleotide and total length and generate a random FASTA file from them. The percentages are included because I thought that was a more useful representation of the composition than the indiviudal numbers of each nucleotide.

Copyright © 2008-2013 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback