[Biojava-l] Re: CORBADEV: LSR news - new RFP about biomolecular entities

Philip Lijnzaad lijnzaad@ebi.ac.uk
Tue, 19 Sep 2000 11:42:44 +0100 (BST)


[ Dear biocorba, biojava and bioxml folks, this mail replies to a request for
  additional CORBA data types to be standardized by the OMG Life Sciences
  Research group. That is, we are trying to establish _which_ biological
  entities related to sequence analysis need to be standardized, not _how_
  (that's the next part). Please feel free to respond, and my apologies if
  you're not interested. Philip Lijnzaad ] 

On Tue, 19 Sep 2000 10:47:57 +0100 (BST), 
"Martin" == Martin Senger <senger@ebi.ac.uk> writes:

Martin> The OMG LSR prepared a draft for a new RFP dealing with additional
Martin> biomolecular sequence analysis entities - those who were not covered
Martin> by the origibal BSA sumbissions. The RFP will be issued this
Martin> December, and the initial responses are due to the end of March
Martin> 2001. The draft is (or shorly will be) available as the OMG document
Martin> lifesci/00-09-10 (or it is available on my desk in room A-235).

Martin> The objective of the RFP is to define entities which can be used and
Martin> produced by sequence analysis as defined in BSA spec. The RFP asks
Martin> for the following entities:

Martin>    Biomolecular sequence alphabet
Martin>    Fuzzy locations
Martin>    Weight matrices
Martin>    Patterns, including profiles and HMMs
Martin>    Phylogenetic trees
Martin>    Assembly (including trace and quality data)
Martin>    Composite annotations (which may be the first step to define a gene)
Martin>    Gene (still under discussion if this is appropriate to put here)
Martin>    Taxonomy
Martin>    GeneticCode extensions, including initiators and terminators
Martin>    BioSequence extensions, inclusing sequence alphabet

Martin> The LSR is prepared to accept also submissions dealing only with some
Martin> of the listed entities. Later the various submissions will merge
Martin> together to cover most of (and ideally all) entities.

Martin> At the moment, it is a good time to ask for other entities to be
Martin> added into the RFP. 

I have just two suggestions for addition:

- a type for representing floating point information on sequence positions,
  such as hydrophobicity plots. I am not going to insist on this, because the
  (nearly) obvious solution is to just use a IDL::float. Or was that double ...

- a type for representing things like dotplots; this is fairly common in
  sequence analysis, and requesting it might just avoid the quagmire of
  different plot and/or image formats (vector based or bit based or even
  plain matrices of grey-(or colour? which colour model?) values). But this
  maybe out of scope, Architecture Board-wise. 

Furthermore, a few comments:

- I think we should put Gene in, but only if the RfP states its own
  definition of what a Gene is, and then ask to provide IDL for this _OR_ to
  clearly state what _they_ think a Gene should be. Just requesting
  "something for representing genes" hasn't worked, so maybe the RfP should
  take a shot at it so submitters can follow up.  This is not meant to shy
  away dissenting submitters, but to prod the LSR, the RfP authors and the
  submitters into something workable.

- Taxonomy and Phylogeny should prolly be on one line.

Hope this helps,

                                                                      Philip

-- 
When C++ is your hammer, everything looks like a thumb. (Steven Haflich)
-----------------------------------------------------------------------------
Philip Lijnzaad, lijnzaad@ebi.ac.uk \ European Bioinformatics Institute,rm A2-24
+44 (0)1223 49 4639                 / Wellcome Trust Genome Campus, Hinxton
+44 (0)1223 49 4468 (fax)           \ Cambridgeshire CB10 1SD,  GREAT BRITAIN
PGP fingerprint: E1 03 BF 80 94 61 B6 FC  50 3D 1F 64 40 75 FB 53