
Researchers from Tsinghua University and Zhipu AI have introduced a new model called LongWriter, specifically LongWriter-glm4-9b, designed to generate long-form text exceeding 10,000 words. This development addresses a limitation in current large language models (LLMs), which can process inputs up to 100,000 tokens but typically struggle to produce outputs longer than 2,000 words. The LongWriter model uses an agent-based pipeline and includes a dataset specifically created for this purpose, with some models fine-tuned for local usage. The paper, from THUDM, detailing this innovation has been assessed using GPT-4 judged metrics.
Existing long context LLM already possesses the potential for a larger output window–all you need is data with extended output during model alignment to unlock this capability. Problem 😕: LLMs struggle to generate outputs exceeding 2,000 words, despite handling inputs over… https://t.co/rqahHFov6y
LongWriter-glm4-9b from @thukeg is capable of generating 10,000+ words at once!🚀 Paper identifies a problem with current long context LLMs -- they can process inputs up to 100,000 tokens, yet struggle to generate outputs exceeding lengths of 2,000 words. Paper proposes that an… https://t.co/2jfKyIpShK
New from @thukeg LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs author @realYushiBai is active in discussion section to answer your questions: https://t.co/UeebckjJjf https://t.co/FfOczYwz4G

