Matthias Bastian

Jul 28, 2023

Researchers have discovered that it is possible to automatically construct adversarial attacks that trick major language models (LLMs) such as ChatGPT, Bard, and Claude into serving unintended and potentially harmful content. Traditional jailbreaks require significant manual effort to develop and can usually be addressed by LLM vendors. However, these automated attacks can be created in large numbers and work on closed-source and publicly available chatbots.

Similar adversarial attacks have existed in computer vision for over a decade, suggesting that such threats may be inherent in AI systems. More worryingly, the research suggests that it may not be possible to completely prevent these types of attacks. As society becomes more dependent on AI technology, these concerns should be taken into account. Perhaps we should just try to use AI in the most positive way possible.

Comment Source: Project site | Twitter oh sorry "X"

Matthias Bastian

Jul 28, 2023

AI in practice

OpenAI's DALL-E 3 could raise the bar in image detail and prompt precision

a person knowing that the universes end is immienent, the universe is conscious for it knows, that its end is near, and wishes to comfort the last person alive to die due to the universe ending in epic fashion. fanart anime fanart, in the universe 16k resolution, godrays, planets and space, extraordinary sights, battle for the universe. unreal engine 60 fps screenshot, treding on art station. Emotional. The End of All Existence.

Matthias Bastian

Jul 28, 2023

AI in practice

Microsoft provides GPT-4 and Meta's Llama to Japanese government

Abstract visualisation of the japanese fleg in a data stream

Matthias Bastian

Jul 27, 2023

AI research

Large Language Models and the Lost Middle Phenomenon

Matthias Bastian

Jul 27, 2023

AI in practice

Google says it is committed to an open Web, but its actions say otherwise

Matthias Bastian

Jul 26, 2023

AI in practice

Microsoft, Anthropic, Google, and OpenAI have launched the Frontier Model Forum, an industry body to promote the safe and responsible development of advanced AI models. The organization will focus on AI safety research, sharing best practices, collaborating with stakeholders, and supporting applications that address societal challenges. The Forum is open to other frontier AI developers working on "frontier models as defined by the Forum" who share the same commitment to safety.

Comment Source: Microsoft | OpenAI | Anthropic | Google