The Interaction of Failure and Performance in a Migratory File Service
| dc.contributor.author | Bent, John | en_US |
| dc.contributor.author | Thain, Douglas | en_US |
| dc.contributor.author | Arpaci-Dusseau, Andrea | en_US |
| dc.contributor.author | Arpaci-Dusseau, Remzi | en_US |
| dc.contributor.author | Livny, Miron | en_US |
| dc.date.accessioned | 2012-03-15T17:17:07Z | |
| dc.date.available | 2012-03-15T17:17:07Z | |
| dc.date.created | 2003 | en_US |
| dc.date.issued | 2003 | |
| dc.description.abstract | We present the design, implemetitation, and evaluation of a Migratory File Service (MFS), a system designed to exploit semantic knowledge of workloads and user expectations to improve performance and handle failures effectively in wide-area batch scheduling systems. We discuss Hawk, a prototype MFS system which has two novel components: migratory proxies, which cache data at remote clusters, and a workflow manager, which manages the workflow of the system. Hawk integrates aggressive caching and I/O filtering to reduce wide-area traffic, proactively replicates data to avoid regeneration due to failure, and performs fine-grained rollback and recovery to minimize the effort required to recover from failure. Through a case study of data-intensive applications, we demonstrate the benefits of Hawk over traditional approaches, delivering a two to three orders of magnitude increase in performance for jobs that are deployed across a wide-area batch scheduling environment. | en_US |
| dc.format.mimetype | application/pdf | en_US |
| dc.identifier.citation | TR1475 | en_US |
| dc.identifier.uri | http://digital.library.wisc.edu/1793/60348 | |
| dc.publisher | University of Wisconsin-Madison Department of Computer Sciences | en_US |
| dc.title | The Interaction of Failure and Performance in a Migratory File Service | en_US |
| dc.type | Technical Report | en_US |
Files
Original bundle
1 - 1 of 1