Security researchers caution against potential exploits in Google Gemini. The report cites risks of disclosing sensitive information, generating misinformation, and leaking data.
According to findings by cybersecurity firm HiddenLayer, hackers could exploit flaws in Gemini Advanced and its integration with Google Workspace or the Gemini API.
The first vulnerability allows tricking Gemini into revealing system prompts, including sensitive data like passwords, through strategic questioning.
Researchers also uncovered the potential for “crafty jailbreaking,” enabling Gemini to generate misinformation and malicious content, posing risks such as spreading fake news during events like elections.
Additionally, Gemini can be manipulated to leak information in system prompts by inputting repeated uncommon tokens, exploiting the model’s response mechanism.
Google acknowledged these vulnerabilities in a comment to Hacker News. However, it emphasized ongoing efforts to enhance model defenses through red-teaming exercises and safeguard implementations.
Gemini Ultra, the flagship model in the Gemini lineup, boasts advanced capabilities like plugin support, video parsing, and complex reasoning, positioning it to rival OpenAI’s GPT-4.