Jinho Lee, Jongwook Chung, et al.
IEEE Transactions on VLSI Systems
Today's common practice in developing conversational agents is pipelining off-the-shelf modularized services as ready-made building blocks. However, the discrete and sequential nature of the modules yields long response latency. We introduce Sci-Fii, a speculative inference framework accelerating conversational agent systems built with off-the-shelf modules, while keeping the modules unchanged.
Jinho Lee, Jongwook Chung, et al.
IEEE Transactions on VLSI Systems
Chungkuk Yoo, Inseok Hwang, et al.
MobiSys 2017
Bumsoo Kang, Inseok Hwang, et al.
MobiSys 2018
Bumsoo Kang, Inseok Hwang, et al.
MobiSys 2018