The Interaction of Failure and Performance in a Migratory File Service

dc.contributor.authorBent, Johnen_US
dc.contributor.authorThain, Douglasen_US
dc.contributor.authorArpaci-Dusseau, Andreaen_US
dc.contributor.authorArpaci-Dusseau, Remzien_US
dc.contributor.authorLivny, Mironen_US
dc.date.accessioned2012-03-15T17:17:07Z
dc.date.available2012-03-15T17:17:07Z
dc.date.created2003en_US
dc.date.issued2003
dc.description.abstractWe present the design, implemetitation, and evaluation of a Migratory File Service (MFS), a system designed to exploit semantic knowledge of workloads and user expectations to improve performance and handle failures effectively in wide-area batch scheduling systems. We discuss Hawk, a prototype MFS system which has two novel components: migratory proxies, which cache data at remote clusters, and a workflow manager, which manages the workflow of the system. Hawk integrates aggressive caching and I/O filtering to reduce wide-area traffic, proactively replicates data to avoid regeneration due to failure, and performs fine-grained rollback and recovery to minimize the effort required to recover from failure. Through a case study of data-intensive applications, we demonstrate the benefits of Hawk over traditional approaches, delivering a two to three orders of magnitude increase in performance for jobs that are deployed across a wide-area batch scheduling environment.en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationTR1475en_US
dc.identifier.urihttp://digital.library.wisc.edu/1793/60348
dc.publisherUniversity of Wisconsin-Madison Department of Computer Sciencesen_US
dc.titleThe Interaction of Failure and Performance in a Migratory File Serviceen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR1475.pdf
Size:
2.6 MB
Format:
Adobe Portable Document Format