AI in practice

Jul 11, 2023Jul 11, 2023

LongLLaMA pushes the limit of context length in open-source LLMs

Profile

E-Mail

Researchers have released a preview of LongLLaMA, a large language model capable of handling long contexts up to 256.000 tokens or more. Built on the open-source OpenLLaMA and fine-tuned using the Focused Transformer (FoT) method, it permits some attention layers to access a memory cache of key-value pairs to extend their context length.

According to the researchers, the model retains performance on tasks that don't require long contexts, and can be used as a drop-in replacement for shorter context LLaMA implementations. The team has released their smaller 3B variant under the Apache 2.0 license, with inference code supporting longer contexts on Hugging Face. More information and examples of LongLLaMA can be found on their GitHub repository.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:

Bank transfer

Sources

GitHub

THE DECODER

Profile

E-Mail

AI research

Jun 30, 2025Jun 30, 2025

Microsoft’s MAI-DxO boosts AI diagnostic accuracy and cuts costs by nearly 70 percent

News, tests and reports about VR, AR and MIXED Reality.

What happens next with MIXED My personal farewell to MIXED Meta and Anduril are now jointly developing XR headsets for the US military MIXED-NEWS.com

AI and society

Jun 30, 2025

US Senate moves to block state AI laws for five years if states take broadband funds

AI and society

Jun 27, 2025Jun 27, 2025

Trump administration plans executive orders to speed up U.S. AI data center expansion

Google News

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

LongLLaMA pushes the limit of context length in open-source LLMs

Microsoft’s MAI-DxO boosts AI diagnostic accuracy and cuts costs by nearly 70 percent

US Senate moves to block state AI laws for five years if states take broadband funds

Trump administration plans executive orders to speed up U.S. AI data center expansion

Cloudflare CEO Matthew Prince sees trouble ahead for the open web

New Othello experiment supports the world model hypothesis for large language models

ChatGPT might be draining your brain, MIT warns - what ‘cognitive debt’ means for you

LongLLaMA pushes the limit of context length in open-source LLMs

Microsoft’s MAI-DxO boosts AI diagnostic accuracy and cuts costs by nearly 70 percent

US Senate moves to block state AI laws for five years if states take broadband funds

Trump administration plans executive orders to speed up U.S. AI data center expansion