{"@context":"http://iiif.io/api/presentation/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/manifest.json","@type":"sc:Manifest","label":"Multi-dimensional Workload Analysis and Synthesis for Modern Storage Systems","metadata":[{"label":"dc.description.sponsorship","value":"This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree."},{"label":"dc.format","value":"Monograph"},{"label":"dc.format.medium","value":"Electronic Resource"},{"label":"dc.identifier.uri","value":"http://hdl.handle.net/11401/77315"},{"label":"dc.language.iso","value":"en_US"},{"label":"dc.publisher","value":"The Graduate School, Stony Brook University: Stony Brook, NY."},{"label":"dcterms.abstract","value":"Modern computer systems produce and process an overwhelming amount of data at an increasing rates. The performance of storage hardware components, however, cannot keep up with the required speed at a practical cost. To mitigate this discrepancy, storage vendors incorporate many workload-driven optimizations in their products. Emerging applications cause workload patterns to change rapidly and significantly. One of the prominent examples is a rapid shift towards virtualized environments. Virtual machines mix I/O streams from different applications and perturb applications' access patterns. In addition, modern users demand more convenience features, such as deduplication, snapshotting, and encryption. Stringent performance requirements, changing I/O patterns, and the growing feature list increase the complexity of storage systems. The complexity of design, in turn, makes the evaluation of the storage systems a difficult task. Storage community needs practical evaluation tools and techniques to resolve this task timely and efficiently. This thesis first explores the complexity of evaluating storage systems. Second, the thesis proposes a Multi-Dimensional Histogram (MDH) workload analysis as a basis for designing a variety of evaluation tools. I/O traces are good sources of information about real-world workloads but are inflexible in representing more than the exact system conditions at the point the traces were captured. We demonstrate how MDH techniques can accurately convert I/O traces to workload models. Historically, most I/O optimizations focused on the metadata: e.g., I/O access patterns, arrival times, read/write sizes. Increasingly, storage systems must also consider the data and not just the metadata. For example, deduplication systems eliminate duplicates in the data to increase logical storage capacity. We use MDH techniques to generate realistic datasets for deduplication systems. The shift from physical to virtual clients drastically changes the I/O workloads seen by Network Attached Storage (NAS). Using MDH techniques we study workload changes caused by virtualization and synthesize a set of versatile NAS benchmarks. It is our thesis that MDH techniques are powerful for both workload analysis and synthesis. MDH analysis bridges the gap between the complexity of storage systems and the availability of practical evaluations tools."},{"label":"dcterms.available","value":"2017-09-20T16:52:29Z"},{"label":"dcterms.contributor","value":"Kuenning, Geoffrey"},{"label":"dcterms.creator","value":"Tarasov, Vasily"},{"label":"dcterms.dateAccepted","value":"2017-09-20T16:52:29Z"},{"label":"dcterms.dateSubmitted","value":"2017-09-20T16:52:29Z"},{"label":"dcterms.description","value":"Department of Computer Science."},{"label":"dcterms.extent","value":"94 pg."},{"label":"dcterms.format","value":"Monograph"},{"label":"dcterms.identifier","value":"http://hdl.handle.net/11401/77315"},{"label":"dcterms.issued","value":"2013-12-01"},{"label":"dcterms.language","value":"en_US"},{"label":"dcterms.provenance","value":"Made available in DSpace on 2017-09-20T16:52:29Z (GMT). No. of bitstreams: 1\nTarasov_grad.sunysb_0771E_11602.pdf: 899235 bytes, checksum: 5dcd6e4f08e2e877f870b169b2da38cf (MD5)\n Previous issue date: 1"},{"label":"dcterms.publisher","value":"The Graduate School, Stony Brook University: Stony Brook, NY."},{"label":"dcterms.subject","value":"Benchmarking, Characterization, Performance, Storage, Workload"},{"label":"dcterms.title","value":"Multi-dimensional Workload Analysis and Synthesis for Modern Storage Systems"},{"label":"dcterms.type","value":"Dissertation"},{"label":"dc.type","value":"Dissertation"}],"description":"This manifest was generated dynamically","viewingDirection":"left-to-right","sequences":[{"@type":"sc:Sequence","canvases":[{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json","@type":"sc:Canvas","label":"Page 1","height":1650,"width":1275,"images":[{"@type":"oa:Annotation","motivation":"sc:painting","resource":{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/96%2F71%2F26%2F96712681378381670355624095691405893727/full/full/0/default.jpg","@type":"dctypes:Image","format":"image/jpeg","height":1650,"width":1275,"service":{"@context":"http://iiif.io/api/image/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/96%2F71%2F26%2F96712681378381670355624095691405893727","profile":"http://iiif.io/api/image/2/level2.json"}},"on":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json"}]}]}]}