Changes between Version 2 and Version 3 of ResearchProjects


Ignore:
Timestamp:
Sep 24, 2009, 9:39:37 AM (15 years ago)
Author:
davea
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • ResearchProjects

    v2 v3  
    55If you're interested, please contact
    66[ProjectPeople David Anderson].
     7
     8== Data-intensive volunteer computing ==
     9
     10Currently, most BOINC projects work as follows:
     11 * Data are stored on the server
     12 * Pieces of data (input files) are sent to client, and jobs are run against them.
     13   When done, the files are deleted from the client.
     14 * Output files are sent back to the server.
     15
     16This architecture doesn't scale well for data-intensive computing.
     17There are various alternatives:
     18
     19 * Workflows: DAGs of tasks connected by intermediate temporary files.
     20   Schedule them so that temp files remain local to client most of the time.
     21 * Stream computing: e.g., IBM Infosphere
     22 * Models that involve computing against a large static dataset:
     23   e.g. !MapReduce, or Amazon's scheme in which they host common
     24   scientific datasets, and you can use EC2 to compute against them.
     25
     26BOINC has some features that may be useful in these scenarios:
     27e.g., locality scheduling and sticky files.
     28It lacks some features that may be needed:
     29e.g., awareness of client proximity,
     30or the ability to transfer files directly between clients.
    731
    832== Virtualizing volunteer computing ==