[Biojava-dev] bits of information
Lachlan Coin
lc1 at sanger.ac.uk
Tue Jun 3 13:44:00 EDT 2003
Hi,
I guess it all depends on your intuition about what information actually
means, but sticking to standard definitions, the low bits of information
reflects the fact that there is not much uncertainty in this
distribution. If the distribution was 100% A, then there
would be no uncertainty, and bits of information should return 0. On the other
hand, information (or uncertainty) is maximised with 25% A,C,G,T.
Lachlan
On Sun, 1 Jun 2003, Schreiber, Mark wrote:
> Hi -
>
> The bitsOfInformation() method from DistributionTools seems to be returning only the average weighted entropy not the actual information.
>
> Eg for a distribution made thus:
>
> //set the weight of a to 0.97
> dist.setWeight(DNATools.a(), 0.97);
> //set the others to 0.01
> dist.setWeight(DNATools.c(), 0.01);
> dist.setWeight(DNATools.g(), 0.01);
> dist.setWeight(DNATools.t(), 0.01);
>
> The bits of information is calculated to be: 0.24194073285321088 bits
>
> This strikes me as a bit low (excuse the pun). Possibly there should be a method called totalEntropy and bits of information should return log2(alpha size) - totalEntropy.
>
> - Mark
>
>
> =======================================================================
> Attention: The information contained in this message and/or attachments
> from AgResearch Limited is intended only for the persons or entities
> to which it is addressed and may contain confidential and/or privileged
> material. Any review, retransmission, dissemination or other use of, or
> taking of any action in reliance upon, this information by persons or
> entities other than the intended recipients is prohibited by AgResearch
> Limited. If you have received this message in error, please notify the
> sender immediately.
> =======================================================================
>
> _______________________________________________
> biojava-dev mailing list
> biojava-dev at biojava.org
> http://biojava.org/mailman/listinfo/biojava-dev
>
-------------------------------------------------------------
Lachlan Coin
Wellcome Trust Sanger Institute Magdalene College
Cambridge CB10 1SA Cambridge CB30AG
Ph: +44 1223 494 820
Fax: +44 1223 494 919
------------------------------------------------------------
More information about the biojava-dev
mailing list