Synthesizing Representative I/O Workloads Using Iterative Distillation

View/ Open
Date
2003Author
Kurmas, Zachary Alan
Keeton, Kimberly
Mackenzie, Kenneth M.
Metadata
Show full item recordAbstract
Storage systems designers are still searching for better methods of obtaining representative I/O workloads to drive studies of I/O systems. Traces of production workloads are very accurate, but inflexible and difficult to obtain. (Privacy and performance concerns discourage most system administrators from collecting such traces and making them available to the public.) The use of synthetic workloads addresses these limitations; however, synthetic workloads are accurate only if they share certain key properties with the production workload on which they are based (e.g., mean request size, read percentage). Unfortunately, we do not know which properties are "key" for a given
workload and storage system.
We have developed a tool, the Distiller, that automatically identifies the key properties (more formally called attribute-values) of the workload. These attribute-values can then be used to generate a synthetic workload representative of the production workload. This paper presents the design and evaluation of the Distiller. We demonstrate how the Distiller finds representative synthetic workloads for simple artificial workloads and three production workload traces.
Collections
- CERCS Technical Reports [193]
Related items
Showing items related by title, author, creator and subject.
-
Generating and Analyzing Synthetic Workloads using Iterative Distillation
Kurmas, Zachary Alan (Georgia Institute of Technology, 2004-05-14)The exponential growth in computing capability and use has produced a high demand for large, high-performance storage systems. Unfortunately, advances in storage system research have been limited by (1) a lack of ... -
Adaptive and Automated Index Selection in Relational DBMS
Frank, Martin Robert; Omiecinski, Edward Robert; Navathe, Shamkant B. (Georgia Institute of Technology, 1994)We present a novel approach for a tool that assists the database administrator in designing an index configuration for a relational database system. A new methodology for collecting usage statistics at run time is ... -
The Virtual Time Machine
Fujimoto, R. M. (Georgia Institute of Technology, 1994)