-
DSpark: Speculative decoding을 활용한 LLM 추론 가속화
DSpark: Confidence Scheduled Speculative Decoding with Semi Autoregressive Generation 논문 : DeepSeek AI & Peking University PDF : https://github.com/deepseek ai/DeepSpec/blob/main/D...
#inference-optimization
DSpark: Confidence Scheduled Speculative Decoding with Semi Autoregressive Generation 논문 : DeepSeek AI & Peking University PDF : https://github.com/deepseek ai/DeepSpec/blob/main/D...