[DAS2] tiled queries for performance

Brian Gilman gilmanb at pantherinformatics.com
Mon Nov 21 21:46:25 UTC 2005


Hello Everyone,

	I've been lurking on the list and wanted to say hi.

	We're looking into this kind of implementation issue ourselves and  
thought that a bitorrent like cache makes the most sense. ie. all  
servers in the "fabric" are issued the query in a certain "hop  
adjacency". These servers then send their data to the client who's  
job it is to assemble the data.

								HTH,

										-B
--
Brian Gilman
President Panther Informatics Inc.
E-Mail: gilmanb at pantherinformatics.com
         gilmanb at jforge.net
AIM: gilmanb1

01000010 01101001 01101111
01001001 01101110 01100110
01101111 01110010 01101101
01100001 01110100 01101001
01100011 01101001 01100001
01101110



On Nov 21, 2005, at 3:47 PM, Allen Day wrote:

> Hi,
>
> I had an idea of how clients may be able to get better response from
> servers by using a tiled query technique.  Here's the basic idea:
>
> ClientA wants features in chr1/1010:2020, and issues a request for  
> that
> range.  No other clients have previously requested this range, so the
> server-side cache faults to the DAS/2 service (slow).
>
> ClientB wants features in chr1/1020:2030, and issues a request for  
> that
> range.  Although the intersection of the resulting records with  
> ClientA's
> query is large, the URIs are different and the server-side cache  
> faults
> again.
>
> If ClientA and ClientB were to each issue two separate "tiled"  
> requests:
>
>  1. chr1/1001:2000
>  2. chr1/2001:3000
>
> ClientB could take advantage of the fact that ClientA had been  
> looking at
> the same tiles.
>
> For this to work, the clients would need to be using the same tile  
> size.
> The optimal tile size is likely to vary from datasource to datasource,
> depending on the length and density distributions of the features
> contained in the datasource.  The "sources" or "versioned sources"
> payload could suggest a tiling size to prospective clients.   
> Servers could
> also pre-cache all tiles by hitting each tile after an update of the
> datasource (or the DAS/2 service code).
>
> The tradeoff for the performance gains is that clients may now need  
> to do
> filtering on the returned records to only return those requested by  
> the
> client's client.
>
> -Allen
> _______________________________________________
> DAS2 mailing list
> DAS2 at portal.open-bio.org
> http://portal.open-bio.org/mailman/listinfo/das2




More information about the DAS2 mailing list