Current Location: Index -> Papers -> Conference
[ICCD'22] HyFarM: Task Orchestration on Hybrid Far Memory for High Performance Per Bit --Jing Wang, Chao Li, et al.
[DAC'22] SALO: An Efficient Spatial Accelerator Enabling Hybrid Sparse Attention Mechanisms for Long Sequences --Guan Shen, Jieru Zhao, et al.
[ICPE'22] Oversubscribing GPU Unified Virtual Memory: Implications and Suggestions. --Chuanming Shao, Jinyang Guo, et al.
[SoCC'22] Characterizing and Orchestrating VM Reservation in Geo-distributed Clouds to Improve the Resource Ef... --Jiuchen Shi, Kaihua Fu, et al.
[SC'22] QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services. --Kaihua Fu, Jiuchen Shi, et al.
[ATC'22] DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on... --Weihao Cui, Han Zhao, et al.
[ATC'22] PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training. --Wei Zhang, Binghao Chen, et al.
[ATC'22] RunD: A Lightweight Secure Container Runtime for High-density Deployment and High-concurrency Startup... --Zijun Li, Jiagan Cheng, et al.
[ATC'22] Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Con... --Zijun Li, Linsong Guo, et al.
[ICS'22] PAME: Precision-Aware Multi-Exit DNN Serving for Reducing Latencies of Batched Inferences. --Shulai Zhang, Weihao Cui, et al.