Giraffe extends Meta's LLaMA context window to dozens of pages

Midjourney prompted by THE DECODER

Giraffe is an open-source LLM with a context window of 32,000 tokens, making it useful for many applications in business contexts.

Large language models like GPT-4 show impressive capabilities but often have a limited context window, which limits their use in tasks where they would have to process dozens of pages. Variants such as GPT-4-32k or Anthropic's Claude with a context window of 100,000 tokens provide a much larger "memory" and are therefore more powerful in such use cases.

Now, researchers have extended the context window of the open-source LLaMA model by up to 10 times using interpolation techniques that reach about 32,000 tokens. The resulting LLM, called Giraffe, comes in a 13 billion parameter version and has one of the largest context windows of any open-source LLM.

Open Source Giraffe provides insight into scaling context windows

Being open-source, the research also provides some important insights into the inner workings of LLMs and different scaling techniques for enlarging the context window. According to the Abacus.AI team, Liner scaling of position embeddings was the most effective at increasing context length, with others also having some effect.

They also found that accuracy on long context tasks decreased with increasing length, demonstrating the limitations of current techniques, and showed that perplexity, commonly used to measure LLM performance, alone is insufficient to measure long context performance, highlighting the need for custom testing.

More information and data are available on the project GitHub, with the Giraffe-v2-13b-32k model being hosted on Hugging Face.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Giraffe extends Meta's LLaMA context window to dozens of pages

Open Source Giraffe provides insight into scaling context windows

Code is just a lossy projection of intent, according to OpenAI researcher Sean Grove

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

OpenAI delays release of open-weight model indefinitely over safety concerns

AI coding can make developers slower even if they feel faster

Musk unveils Grok 4 as xAI’s new AI model that beats OpenAI and Google on major benchmarks

"Cat attack" on reasoning model shows how important context engineering is

Giraffe extends Meta's LLaMA context window to dozens of pages

Open Source Giraffe provides insight into scaling context windows

Code is just a lossy projection of intent, according to OpenAI researcher Sean Grove

Kimi-K2 is the next open-weight AI milestone from China after Deepseek

OpenAI delays release of open-weight model indefinitely over safety concerns