当前位置:首 页 ->
论文发表 ->
Conference [ICCD'22] HyFarM: Task Orchestration on Hybrid Far Memory for High Performance Per Bit --Jing Wang, Chao Li, et al. [DAC'22] SALO: An Efficient Spatial Accelerator Enabling Hybrid Sparse Attention Mechanisms for Long Sequences --Guan Shen, Jieru Zhao, et al. [ICPE'22] Oversubscribing GPU Unified Virtual Memory: Implications and Suggestions. --Chuanming Shao, Jinyang Guo, et al. [SoCC'22] Characterizing and Orchestrating VM Reservation in Geo-distributed Clouds to Improve the Resource Ef... --Jiuchen Shi, Kaihua Fu, et al. [SC'22] QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services. --Kaihua Fu, Jiuchen Shi, et al. [ATC'22] DVABatch: Diversity-aware Multi-Entry Multi-Exit Batching for Efficient Processing of DNN Services on... --Weihao Cui, Han Zhao, et al. [ATC'22] PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training. --Wei Zhang, Binghao Chen, et al. [ATC'22] RunD: A Lightweight Secure Container Runtime for High-density Deployment and High-concurrency Startup... --Zijun Li, Jiagan Cheng, et al. [ATC'22] Help Rather Than Recycle: Alleviating Cold Startup in Serverless Computing Through Inter-Function Con... --Zijun Li, Linsong Guo, et al. [ICS'22] PAME: Precision-Aware Multi-Exit DNN Serving for Reducing Latencies of Batched Inferences. --Shulai Zhang, Weihao Cui, et al.