Microsoft Engineer Raises Concerns Over AI Image Generator’s Security

A Microsoft AI engineer, Shane Jones, has brought attention to potential security flaws in OpenAI鈥檚 DALL-E 3 model, used in Microsoft鈥檚 Designer AI image creator.

Jones claims to have discovered a vulnerability in early December that allowed users to bypass safety guardrails, leading to the creation of explicit and violent deepfake images, including those of singer Taylor Swift.

Jones expressed his worries by sending a letter to Washington State鈥檚 Attorney General and US senators, alleging that Microsoft downplayed the severity of the flaws in DALL-E 3.

Microsoft, in response, has stated that the reported techniques did not breach their safety filters, and they are addressing any remaining concerns with the concerned employee.

Allegations of Downplaying and Microsoft鈥檚 Response

Jones, in his letter, contends that Microsoft was aware of the vulnerabilities and the potential for misuse but did not adequately address the issues. He further claims that, after reporting the matter to Microsoft, he was instructed to send the details to OpenAI, the technology鈥檚 developer.

Even though he attempted to bring attention to the flaws, Jones asserts that he did not receive a response from either Microsoft or OpenAI.

In response to these allegations, Microsoft stated that they encouraged the employee to report through OpenAI鈥檚 channels and that they investigated the concerns raised.

An OpenAI spokesperson affirmed that the reported technique did not bypass their safety systems, and they have implemented additional safeguards for their products, including declining requests that ask for a public figure by name.

Taylor Swift Deepfake Incident

The explicit deepfake images of Taylor Swift, allegedly generated using Microsoft鈥檚 Designer AI and OpenAI鈥檚 DALL-E 3, have ignited concerns about the potential misuse of AI in creating harmful content.

Jones points to the vulnerabilities in DALL-E 3 and similar products as posing a risk to public safety, especially with the capacity to generate disturbing images.

Microsoft CEO Satya Nadella, when asked about the Taylor Swift deepfakes, expressed concern, stating, 鈥渨e have to act.鈥 The company, responding to the emergence of these deepfakes, reinforced its commitment to providing a safe and respectful experience for users.

Silencing Concerns and Calls for Government Intervention

Shane Jones claims that Microsoft鈥檚 legal department demanded the removal of his public letter urging OpenAI to address the DALL-E 3 vulnerabilities. Despite his willingness to assist in fixing the specific vulnerability, Jones alleges that Microsoft鈥檚 legal team did not respond or communicate directly with him.

In his letter to Washington State鈥檚 Attorney General and US representatives, Jones advocates for the creation of a government system to report and track AI-related issues.

He emphasises the importance of ensuring that employees can raise concerns without fear of retaliation and suggests that companies developing AI products should be held accountable for disclosing known risks.