4x faster LLM inference (Flash Attention guy's company)

Wait 5 sec.

Article URL: https://www.together.ai/blog/adaptive-learning-speculator-system-atlasComments URL: https://news.ycombinator.com/item?id=45556474Points: 15# Comments: 0