Show simple item record

dc.contributor.advisorKrishna, Tushar
dc.contributor.authorKwon, Hyouk Jun
dc.date.accessioned2020-09-08T12:47:53Z
dc.date.available2020-09-08T12:47:53Z
dc.date.created2020-08
dc.date.issued2020-07-23
dc.date.submittedAugust 2020
dc.identifier.urihttp://hdl.handle.net/1853/63663
dc.description.abstractDeep neural network (DNN) accelerators, which are specialized hardware for DNN inferences, enabled energy-efficient and low-latency DNN inferences. To maximize the efficiency (energy efficiency, latency, and throughput) of DNN accelerators, DNN accelerator designers optimize DNN accelerator and mapping of target DNN models on the accelerator. However, designing DNN accelerators for recent DNN models that contain diverse layer operations and size is challenging since optimizing accelerator and mapping for the average case of the layers in target DNN workloads often lead to uniformly inefficient design points. Therefore, this thesis proposes to design flexible mapping DNN accelerators that can run multiple mappings to adapt to diverse DNN layers in DNN workloads. This thesis first quantifies the costs and benefits of mapping using a data-centric approach. Based on the observation that no single mapping is ideal for all layers, this thesis explores two approaches to design flexible mapping accelerators: reconfigurability and heterogeneity. Reconfigurable accelerators are based on communication-centric approach that implements flexible network-on-chip (NoC) to enable to configure accelerator at runtime for any mapping styles. Heterogeneous accelerators employ multiple sub-accelerators with fixed but diverse mapping styles within an accelerator chip to provide coarser-grained flexibility with lower area and power cost than the reconfigurability. Case studies show that both approaches provide Pareto-optimal design points with different strengths.
dc.format.mimetypeapplication/pdf
dc.language.isoen_US
dc.publisherGeorgia Institute of Technology
dc.subjectDNN accelerator
dc.subjectDNN dataflow
dc.subjectDNN mapping
dc.subjectFlexible mapping accelerator
dc.titleData- and communication-centric approaches to model and design flexible deep neural network accelerators
dc.typeDissertation
dc.description.degreePh.D.
dc.contributor.departmentComputer Science
thesis.degree.levelDoctoral
dc.contributor.committeeMemberPellauer, Michael
dc.contributor.committeeMemberSarkar, Vivek
dc.contributor.committeeMemberKim, Hyesoon
dc.contributor.committeeMemberTumanov, Alexey
dc.date.updated2020-09-08T12:47:53Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record