Giraffe extends Meta's LLaMA context window to dozens of pages

Midjourney prompted by THE DECODER

Giraffe is an open-source LLM with a context window of 32,000 tokens, making it useful for many applications in business contexts.

Large language models like GPT-4 show impressive capabilities but often have a limited context window, which limits their use in tasks where they would have to process dozens of pages. Variants such as GPT-4-32k or Anthropic's Claude with a context window of 100,000 tokens provide a much larger "memory" and are therefore more powerful in such use cases.

Now, researchers have extended the context window of the open-source LLaMA model by up to 10 times using interpolation techniques that reach about 32,000 tokens. The resulting LLM, called Giraffe, comes in a 13 billion parameter version and has one of the largest context windows of any open-source LLM.

Open Source Giraffe provides insight into scaling context windows

Being open-source, the research also provides some important insights into the inner workings of LLMs and different scaling techniques for enlarging the context window. According to the Abacus.AI team, Liner scaling of position embeddings was the most effective at increasing context length, with others also having some effect.

They also found that accuracy on long context tasks decreased with increasing length, demonstrating the limitations of current techniques, and showed that perplexity, commonly used to measure LLM performance, alone is insufficient to measure long context performance, highlighting the need for custom testing.

More information and data are available on the project GitHub, with the Giraffe-v2-13b-32k model being hosted on Hugging Face.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Giraffe extends Meta's LLaMA context window to dozens of pages

Open Source Giraffe provides insight into scaling context windows

New Google model to autonomously control browsers and mobile apps

Google expands AI-powered search mode to Europe

Elevenlabs releases open-source UI library for voice and audio applications

OpenAI suddenly remembers that copyright law exists after a few days of wild Sora videos

OpenAI unveils Sora 2 video model with realistic physics, high-quality audio, and a new social app

Deepmind says video models for visual tasks could become what LLMs are for text tasks

Giraffe extends Meta's LLaMA context window to dozens of pages

Open Source Giraffe provides insight into scaling context windows

New Google model to autonomously control browsers and mobile apps

Google expands AI-powered search mode to Europe

Elevenlabs releases open-source UI library for voice and audio applications