#层跳过

Self-Speculative Decoding: 大语言模型推理加速的创新方法

2 个月前
Cover of Self-Speculative Decoding: 大语言模型推理加速的创新方法