TechCrunch Minute: How Anthropic found a trick to get AI to give you answers it’s not supposed to
If you build it, people will try to break it. Sometimes even the people building stuff are the ones breaking it. Such is the case with Anthropic and its latest research which demonstrates an interesting vulnerability in current LLM technology. More or less if you keep at a question, you can break guardrails and wind up […]
© 2024 TechCrunch. All rights reserved. For personal use only.