1
$Id: Changes,v 1.39 2003/12/22 20:00:34 heikki Exp $
1
$Id: Changes,v 1.49.4.1 2006/10/02 23:10:11 sendu Exp $
3
3
Revision history for Bioperl core modules
9
1.5 series - these releases represent the pre-1.6 releases, the 1.6 series
10
will be branches from the last 1.5.x release.
12
1.5.2 Developer release
13
Full details of changes since 1.5.1 are available online at:
14
http://www.bioperl.org/wiki/Change_log
15
The following represents a brief overview of the most important changes.
18
- Overhaul. Brand new system fully allows markers to have multiple
19
positions on multiple maps, and to have relative positions. Should be
23
- This module and all the modules in the Taxonomy directory now
24
deprecated in favour of Bio::Taxon and Bio::Tree::Tree
29
* get_Taxonomy_Node() eventually to be deprecated, renamed get_taxon().
31
* New methods ancestor(), each_Descendent() and _handle_internal_id().
33
* Allows for different database modules to create Bio::Taxon objects
34
with the same internal id when the same taxon is requested from each.
37
* get_Children_Taxids() is deprecated, superceded by each_Descendent().
39
* No longer includes the fake root node 'root'; there are multiple roots
40
now (10239, 12884, 12908, 29384 and 131567). Consistent with entrez.pm
43
* get_node() has new option -full
45
* Caches data retrieved from website
48
- Now a Bio::Taxon. Carries out the species name -> specific name munging
49
that Bio::DB::Taxonomy modules and SeqIO modules used to do, for
50
backward compatability in species() method.
52
o Bio::Search and Bio::SearchIO
53
- Overhaul. The existing system has been sped up via some minor changes
54
(mostly gain-of-function to the API). Bio::PullParserI is introduced
55
as a potential eventual replacment for the existing system, though as
56
yet only a Hmmpfam parser exists written using it.
59
1.5.1 Developer release
61
o Major problem with how Annotations were written out with
62
Bio::Seq is fixed by reverting to old behavior for
63
Bio::Annotation objects.
68
* bug #1871; REFLOOP' parsing loop, I changed the pattern to
69
expect at l east 9 spaces at the beginning of a line to
70
indicate line wrapping.
72
* Treat multi-line SOURCE sections correctly, this defect broke
73
both common_name() and classification()
75
* parse swissprot fields in genpept file
77
* parse WGS genbank records
80
* Changed regexp for ID line. The capturing parentheses are
81
the same, the difference is an optional repeated-not-semi-
82
colon expression following the captured \S+. This means the
83
regexp works when the division looks like /PRO;/ or when the
84
division looks like /ANG ;/ - the latter is from EMBL
87
* fix ID line parsing: the molecule string can have spaces in
88
it. Like: "genomic DNA"
90
- swiss.pm: bugs #1727, #1734
93
* Added parser for entrezgene ASN1 (text format) files.
94
Uses Bio::ASN1::EntrezGene as a low level parser (get it from CPAN)
98
- maf.pm coordinate problem fixed
100
o Bio::Taxonomy and Bio::DB::Taxonomy
102
- Parse NCBI XML now so that nearly all the taxonomy up-and-down
103
can be done via Web without downloading all the sequence.
105
o Bio::Tools::Run::RemoteBlast supports more options and complies
106
to changes to the NCBI interface. It is reccomended that you
107
retrieve the data in XML instead of plain-text BLAST report to
108
insure proper parsing and retrieval of all information as NCBI
109
fully expects to change things in the future.
111
o Bio::Tree and Bio::TreeIO
113
- Fixes so that re-rooting a tree works properly
115
- Writing out nhx format from a newick/nexus file will properly output
116
bootstrap information. The use must move the internal node labels over
118
for my $node ( grep { ! $_->is_Leaf } $tree->get_nodes ) {
119
$node->bootstrap($node->id);
122
- Nexus parsing is much more flexible now, does not care about
125
- Cladogram drawing module in Bio::Tree::Draw
127
- Node height and depth now properly calculated
129
- fix tree pruning algorithm so that node with 1 child gets merged
131
o Graphics tweaks. Glyph::xyplot improved. Many other small-medium sized
132
bugs and improvements were added, see Gbrowse mailing list for most of
135
o Bio::DB::GFF partially supports GFF3. See information about
136
gff3_munge flag in scripts/Bio-DB-GFF/bulk_load_gff.pl.
138
o Better location parsing in Bio::Factory::FTLocationFactory -
139
this is part of the engine for parsing EMBL/GenBank feature table
140
locations. Nested join/order-by/complement are allowed now
142
o Bio::PrimarySeqI->translate now takes named parameters
144
o Bio::Tools::Phylo::PAML - parsing RST (ancestral sequence
145
reconstruction) is now supported. Parsing different models and
146
branch specific parametes are now supported.
148
o Bio::Factory::FTLocationFactory - parse hierarchical locations
151
o Bio::Matrix::DistanceMatrix returns arrayrefs instead of arrays
152
for getter/setter functions
156
- blast bug #1739; match scientific notation in score
157
and possible e+ values
159
- blast.pm reads more WU-BLAST parameters and parameters, match
160
a full database pathname,
162
- Handle NCBI WEB and newer BLAST formats specifically
163
(Query|Sbjct:) match in alignment blocks can now be (Query|Sbjct).
165
- psl off-by-one error fixed
167
- exonerate parsing much improved, CIGAR and VULGAR can be parsed
168
and HSPs can be constructed from them.
170
- HSPs query/hit now have a seqdesc field filled out (this was
171
always available via $hit->description and
172
$result->query_description
174
- hmmer.pm can parse -A0 hmmpfam files
176
- Writer::GbrowseGFF more customizeable.
178
o Bio::Tools::Hmmpfam
179
make e-value default score displayed in gff, rather than raw score
180
allow parse of multiple records
183
1.5 Developer release
185
o Bio::Align::DNAStatistics and Bio::Align::ProteinStatistics
186
provide Jukes-Cantor and Kimura pairwise distance methods,
189
o Bio::AlignIO support for "po" format of POA, and "maf";
190
Bio::AlignIO::largemultifasta is a new alternative to
191
Bio::AlignIO::fasta for temporary file-based manipulation of
192
particularly large multiple sequence alignments.
194
o Bio::Assembly::Singlet allows orphan, unassembled sequences to
195
be treated similarly as an assembled contig.
197
o Bio::CodonUsage provides new rare_codon() and probable_codons()
198
methods for identifying particular codons that encode a given
201
o Bio::Coordinate::Utils provides new from_align() method to build
202
a Bio::Coordinate pair directly from a
203
Bio::Align::AlignI-conforming object.
205
o Bio::DB::Biblio::eutils is a class for querying NCBI's Eutils.
206
Send a Pubmed, Pubmed Central, Entrez, or other query to NCBI's
207
web service using standard Pubmed query syntax, and retrieve
210
o Bio::DB::GFF has various sundry bug fixes.
212
o Bio::FeatureIO is a new SeqIO-style subsystem for
213
writing/reading genomic features to/from files. I/O classes
214
exist for BED, GTF (aka GFF v2.5), and GFF v3. Bio::FeatureIO
215
classes only read/write Bio::SeqFeature::Annotated objects.
216
Notably, the GFF v3 class requires features to be typed into the
219
o Bio::Graph namespace contains new modules for manipulation and
220
analysis of protein interaction graphs.
222
o Bio::Graphics has many bug fixes and shiny new glyphs.
224
o Bio::Index::Hmmer and Bio::Index::Qual provide multiple-file
225
indexing for HMMER reports and FASTA qual files, respectively.
227
o Bio::Map::Clone, Bio::Map::Contig, and Bio::Map::FPCMarker are
228
new objects that can be placed within a Bio::Map::MapI-compliant
229
genetic/physical map; Bio::Map::Physical provides a new physical
230
map type; Bio::MapIO::fpc provides finger-printed clone mapping
233
o Bio::Matrix::PSM provide new support for postion-specific
234
(scoring) matrices (e.g. profiles, or "possums").
236
o Bio::Ontology::Ontology and Bio::Ontology::Term objects can now
237
be instantiated without explicitly using Bio::OntologyIO. This
238
is possible through changes to Bio::Ontology::OntologyStore to
239
download ontology files from the web as necessary. Locations of
240
ontology files are hard-coded into
241
Bio::Ontology::DocumentRegistry.
243
o Bio::PopGen includes many new methods and data types for
244
population genetics analyses.
246
o New constructor to Bio::Range, unions(). Given a list of
247
ranges, returns another list of "flattened" ranges --
248
overlapping ranges are merged into a single range with the
249
mininum and maximum coordinates of the entire overlapping group.
251
o Bio::Root::IO now supports -url, in addition to -file and -fh.
252
The new -url argument allows one to specify the network address
253
of a file for input. -url currently only works for GET
254
requests, and thus is read-only.
256
o Bio::SearchIO::hmmer now returns individual Hit objects for each
257
domain alignment (thus containing only one HSP); previously
258
separate alignments would be merged into one hit if the domain
259
involved in the alignments was the same, but this only worked
260
when the repeated domain occured without interruption by any
261
other domain, leading to a confusing mixture of Hit and HSP
264
o Bio::Search::Result::ResultI-compliant report objects now
265
implement the "get_statistics" method to access
266
Bio::Search::StatisticsI objects that encapsulate any
267
statistical parameters associated with the search (e.g. Karlin's
268
lambda for BLAST/FASTA).
270
o Bio::Seq::LargeLocatableSeq combines the functionality already
271
found in Bio::Seq::LargeSeq and Bio::LocatableSeq.
273
o Bio::SeqFeature::Annotated is a replacement for
274
Bio::SeqFeature::Generic. It breaks compliance with the
275
Bio::SeqFeatureI interface because the author was sick of
276
dealing with untyped annotation tags. All
277
Bio::SeqFeature::Annotated annotations are Bio::AnnotationI
278
compliant, and accessible through Bio::Annotation::Collection.
280
o Bio::SeqFeature::Primer implements a Tm() method for primer
281
melting point predictions.
283
o Bio::SeqIO now supports AGAVE, BSML (via SAX), CHAOS-XML,
284
InterProScan-XML, TIGR-XML, and NCBI TinySeq formats.
286
o Bio::Taxonomy::Node now implements the methods necessary for
287
Bio::Species interoperability.
289
o Bio::Tools::CodonTable has new reverse_translate_all() and
290
make_iupac_string() methods.
292
o Bio::Tools::dpAlign now provides sequence profile alignments.
294
o Bio::Tools::GFF now parses GFF version 2.5 (a.k.a. GTF).
296
o Bio::Tools::Fgenesh, Bio::Tools::tRNAscanSE are new report
299
o Bio::Tools::SiRNA includes two new rulesets (Saigo and Tuschl)
300
for designing small inhibitory RNA.
302
o Bio::Tree::DistanceFactory provides NJ and UPGMA tree-building
303
methods based on a distance matrix.
305
o Bio::Tree::Statistics provides an assess_bootstrap() method to
306
calculate bootstrap support values on a guide tree topology,
307
based on provided bootstrap tree topologies.
309
o Bio::TreeIO now supports the Pagel (PAG) tree format.
315
o Improvements to Bio::AlignIO::nexus for parsing TreeBase nexus files
317
o Bio::Graphics will work with gd1 or gd2
320
- hmmer.pm Better hmmpfam parsing, fix bug for small number of alignment outputs
322
- blast.pm Parse multi-line query fields properly
323
- small speed improvements to blasttable.pm and others
325
o Bio::DB::Taxonomy has better support for hierarchy traversal so that
326
Bio::Taxonomy::Node can be as simple as Bio::Species object while still
327
supporting more complex queries
8
330
1.4. Stable major release
10
332
Since initial 1.2.0, 3000 separate changes have been made to make this release.