OpenAI Adopts Google's SynthID Watermark for AI Images with Verification Tool (openai.com)
219 points by smooke 8 hours ago
himata4113 5 hours ago
if you tell it to generate the AI image with a black background you can visually see the synthid with a good enough monitor, it's just a repeating fuzzy pattern, nothing special.
I have found great success of getting rid of it by masking every 2nd pixel, regenerating missing pixels and then once again masking every 2nd pixel offset by 1.
Used an off the shelf model to fill in the pixels, but I also exported a depthmap first (before any alternations) and denoised it so generated masked pixels comform to the original content. The result was obviously not 100% perfect, but with more time and a model fine tuned for this specific use-case would be able to remove any kind of ai watermarking without too many issues.
cryptoegorophy 9 minutes ago
Can an image just be stretched or compressed a very tiny bit?
teravor 4 hours ago
i wouldn't have any confidence in being able to remove a 0.5 bit watermark (presence/absence). what you see is probably a functional decoy.
userbinator 3 hours ago
but with more time and a model fine tuned for this specific use-case would be able to remove any kind of ai watermarking without too many issues.
Always amusing to see AI used against itself.
m00dy 42 minutes ago
It’s definitely hackable, Some of our engineers worked on this long time ago
https://deepwalker.xyz/blog/bypassing-synthid-in-gemini-phot...
tantalor 4 hours ago
But why
noir_lord 4 hours ago
Because we are on Hacker News.
tantalor 4 hours ago
big_toast 7 hours ago
What information is included in the metadata or SynthID? How many bits can be encoded in a SynthID?
Can it be used to create something like nutritional labels for synthetic content? 10% synthetic text, 30 synthetic images.
Your reality was 15% synthetic today (75% mega corp, 25% open-weight neocloud).
big_toast 6 hours ago
I guess the SynthID-Image paper from Oct 2025[0] was an encoder-decoder for which they tested checking a flag or a 136 bit payload in 512x512 images and the watermark's robustness after various transformations.
Presumably the deployed version is meaningfully different.
echelon 5 hours ago
This is very similar to audiowmark
https://github.com/swesterfeld/audiowmark
You can stuff per-item database unique IDs, user IDs, geohashes, and other nefarious things inside.
We need to protest this LOUDLY.
Our devices are being locked down, we're having attestation and trusted computing forced on us, the internet all over the world is undergoing age verification with full ID verification.
Just because this is on "ai images" today doesn't mean it won't be on all images - screenshots, your camera reel, etc. - in the fullness of time.
This is scary.
These are the tools of 1984. They've been boiling the water slowly, but in the last year things have really started to pick up pace. Please push back. Loudly.
Everyone at Google and OpenAI working on this: WHAT THE FUCK ARE YOU DOING. STOP.
We have laws and mechanisms to prevent revenge porn, CSAM, defamation, etc. They are robust and can be made even stronger. We do not need to sacrifice the security of our privacy and our speech to fight imagined harms when the real danger is turning into an authoritarian society.
tadfisher 3 hours ago
Dylan16807 an hour ago
Extropy_ 5 hours ago
gumby271 3 hours ago
Or perhaps a user id or fingerprint to an individual. We added that to printers long ago, this would easily enable that for every photo and image you generate too.
janalsncm 5 hours ago
Don’t think that would be possible. If I paste a synthetic piece into an otherwise organic image, the synth id isn’t going to know that.
animal_spirits 5 hours ago
Synth ID can detect parts of images with the watermark.
WhatIsDukkha 6 hours ago
This is just performative nonsense.
As someone that creates things with tools with different media I would just hard avoid this tool that adds...
arbitrary metadata not of my choosing.
Should I seriously make a texture for a videogame with this weird DRM glorp in it?
How old is photoshop and why is it exempt?
ericpruitt 6 hours ago
Just because something isn't perfect doesn't mean it's not useful. I've already seen posts online that were able to be proven as falsified because someone ran the images through Google for SynthID checks.
> How old is photoshop and why is it exempt?
For one, it's not developed by Google or OpenAI. The barrier to entry to making realistic but deceptive images with Photoshop is far higher than with AI, and there are already techniques that can, imperfectly, be used to detect the use of traditional image editing.
WhatIsDukkha 4 hours ago
So 999 people that are just making an image need to be DRM'ed so that you might catch the 1 person making "realistic but deceptive" images... like this is some kind of special case of ... internet images.
space_fountain 4 hours ago
nathancahill 2 hours ago
I mean I see a lot of images online where people forget or don't care enough to remove/crop the Gemini watermark.
bsder 3 hours ago
I guarantee this works poorly, at best.
If this actually works solidly, Google is in deep, deep, deep shit. It would mean that I can put a mark on my non-AI videos and demand that Google not allow upload of my identifiably copyrighted content.
This would completely obliterate YouTube.
fc417fc802 an hour ago
Jtarii 6 hours ago
>How old is photoshop and why is it exempt?
I'm sure you can think of a couple things that differentiate gen AI from photoshop, I believe in you.
WhatIsDukkha 5 hours ago
The main difference is we are in the middle of a moral panic and people have lost perspective.
Its a tool with different modalties and affordances.
surgical_fire 4 hours ago
janalsncm 5 hours ago
Strictly speaking, DRM = digital rights management, which is related to intellectual property.
SynthID would only be DRM if Google/OpenAI were claiming IP rights over their images. I don’t even know if that’s legal though.
WhatIsDukkha 4 hours ago
What value does "strictly speaking" bring to the discussion?
So that you don't have to address any of the issues?
janalsncm 3 hours ago
dylan604 3 hours ago
drdeca 3 hours ago
runarberg 4 hours ago
For the record: https://en.wikipedia.org/wiki/Printer_tracking_dots
Barbing 6 hours ago
> How old is photoshop and why is it exempt?
How does today’s maximum theoretical disinformation output per minute compare to 2021 Photoshop?
WhatIsDukkha 4 hours ago
Its 2026... people are deliberately choosing to live in their own realities with no care about objective facts or moral choices.
So weird images are a big problem? No they don't matter at all.
Barbing 4 hours ago
surgical_fire 4 hours ago
doctorpangloss 3 hours ago
You: "performative nonsense! Arbitrary metadata not of my choosing!"
Also you: well, games go through some kind of distribution, which has plenty of telemetry and metadata. Whether it is App Store with notarization, or Steam or Itch who collect analytics and know a lot about you, or your ISP if you self host your eclectic WebGL game from home. Posting on an iPhone or Android phone, to hacker News which has your email address, on your cell network which has IPv6 globally unique addresses...
"But my choosing!" You'll say. It is extremely performative of you to say, "everything that would make me 200% wrong isn't valid."
I don't know. I really hate these vibes-driven reactions to (checks notes) content attribution. Every accusation is a confession in this frame of mind. How do you not see that?
WhatIsDukkha 2 hours ago
You are asserting that the existence of metadata in other venues to be proof that this form of watermarking metadata is just fine with you and should be for everyone else because... nope don't see any reason listed here.
I have an IP address so therefore this is all fine?
"Every accusation is a confession" also seems like an insinuation that I have something to hide but you have "nothing to hide, nothing to fear"ie the very generic privacy right fallacy.
As for "vibes driven"... this whole technical "fix" is a result of the reactionary "vibe" of the ai moral panic, your "notes" don't seem to be providing any perspective there?
keyle 22 minutes ago
Is it like metadata in mp3?
If I take a screenshot of an AI image, will that then be seen as an AI image? Is that 'hidden in the image' or as metadata?
userbinator 2 hours ago
Currently, this article is conveniently right next to it: https://news.ycombinator.com/item?id=48200569
amazingamazing 8 hours ago
Good. Despite people saying it will be removed, I have seen no reproducible repo demonstrating it.
raincole 7 hours ago
Stable Diffusion with 10%~15% denoising strength. Done.
I tested the day 1 when Nano Banana Pro was released and it worked. It still works today for Nano Banana 2.
I didn't post this anywhere because I (arrogantly) thought saying it publicly would make the internet worse. But it was pure arrogancy: if I came up with this the first day then of course other millions of programmers did too.
That being said, it'll introduce the typical artifacts from SD models and that might be detected by other methods (or just by zooming in a lot and looking carefully).
vunderba 7 hours ago
Yup, OOC a while back I put together a ComfyUI node that took in a NB image and start with the smallest amount of denoise strength using Flux.1 (but works with any model), then run img2img with a synthid check incrementing denoise in a loop until it was defeated.
Never released it, but it was obvious to most people in the SD community that denoising using a diffusion model was a relatively trivial means to beat most steganographic watermarks.
londons_explore 6 hours ago
zulban 6 hours ago
> if I came up with this the first day then of course other millions of programmers did too.
Don't sell yourself short. I'm sure it was only hundreds of thousands.
amazingamazing 7 hours ago
Post a repro. I can do that too but then the similarity index is weak. The point is that it it looks indistinguishable then the integrity persists.
In my tests the image looks clearly distinct. In other words, if you can tell the difference then it isn’t a good test.
DonsDiscountGas 5 hours ago
Probably a lot easier to use a different model in the first place
dvngnt_ 5 hours ago
It will but many people won't as i've seen disinformation that could be detected by synth-id.
4ashz 5 hours ago
First they verify whether a picture came from OpenAI, then they'll include subscriber data and geolocation.
Well, they'll finally find out that no one wants to look at AI generated pictures or text. Once they do that, the tool will fail for the public and only work for the government.
Gigachad 3 hours ago
Seemingly the only use for photo realistic ai generation is deception. We are already seeing AI generated video used in political ads in America.
mpetrovich an hour ago
Seems inferior to C2PA, which is actually an open standard: https://contentauthenticity.org/
CSMastermind 8 hours ago
Aren't these kinds of watermarks easy to remove or distort? Seems like they're only helpful as long as people are relying on them sparingly so it's not worth the effort to circumvent.
If social media platforms started banning images with these watermarks seems like they'd be stripped out overnight.
amazingamazing 8 hours ago
No, they are very resistant to modification that can be done easily. That being said I doubt it is impossible
janalsncm 5 hours ago
Yeah cropping, color shifting, resizing and compression don’t remove it. That said, there’s pretty well known workarounds:
surgical_fire 3 hours ago
snissn 7 hours ago
I’m surprised! I guess I’m being naive but I would imagine you could pass an image to an image model without synthid and have it reconstruct the image in a net new way without the markers. I guess I’m wrong? That’s cool if the watermarks are so deeply ingrained that they persist
cephei 7 hours ago
Tiberium 8 hours ago
I still don't think there's a single GitHub repo that actually removes real SynthID watermarks from Nano Banana 2/NBPro outputs. Most of them are just some research projects that haven't achieved this. The only methods so far I've seen are weird tricks with transparency/overlaying the original image if you're using edits, and also using a diffusion model to regenerate the NB-generated image at low noise levels, but this also modifies the original.
vunderba 7 hours ago
Right I think that’s why you probably need to start with very low levels of denoising and experiment with different approaches.
Set up as a ComfyUI workflow that does a few things: it tries SDXL, Flux, and a couple of different denoising methods at the lowest possible strength (progressively incrementing) to avoid changing the image too much, while also running a SynthID check each time, and repeating this in a loop until the watermark is essentially gone.
At the same time, you’d probably want to add some kind of threshold based on a perceptual hash aka the maximum perceptual quality difference you’re willing to accept.
programd 7 hours ago
Define easily. There is an approach that apparently works and is based on spectral analysis of the images.
toraway 6 hours ago
FWIW there are a few people in the issues saying that the tool is giving false negatives and the output image gets flagged by the actual Gemini API as having SynthID. Most recently 3 weeks ago without a response.
Arnt 7 hours ago
This one was released a few years ago and still seems unbroken. I'm sure it will be broken at some point, but if you have to wait a year or two from when you make a deepfake until you can post it on Facebook, maybe that's enough. Maybe even a month is enough.
ZeWaka 6 hours ago
I imagine the technique of having AI recreate the image from scratch based on a very detailed description might work.
raincole 6 hours ago
That'd not work with today's technology. No open model's prompt adherence is anywhere remotely close to ChatGPT/NanoBanana. 'remotely' here is a funny understatement, as I don't have a strong enough word in my vocabulary to describe how far the open models are behind the closed ones.
Writing a more detailed description does not make the models stick to it more.
vunderba 6 hours ago
julianozen 7 hours ago
While these are great, isn’t the problem that malicious actors will create systems that do not use synthID
nerdsniper 7 hours ago
It helps significantly in the current moment. A lot of people are lazy and are getting caught quickly by SynthID.
Eventually it won’t matter when image generation is cheap. But few self-host today and few are willing to pay unsubsidized prices, so the vast majority are using the Gemini, OpenAI, and Midjourney. If all 3 adopted SynthID, only a small fraction would use something else.
echelon 6 hours ago
These systems should be removed.
This is antithetical to freedom and privacy.
There should be no way for anyone to track down who posted a political meme, anti-religious message, or any other legally protected speech. This will come back to bite us in the ass if we keep building it.
Soon every image or communication we make will be watermarked if we continue to let this shit seep into the commons. Everything from your phone photos, to your screenshots, to your social media posts.
One day soon Republicans or Democrats or whoever doesn't like your freedoms will use this tech to identify you and control you.
There are laws for harms - CSAM, revenge porn, etc. Social media platforms can identify, ban, and report abusers. The framework of the law can take care of the rest.
Our digital footprint should not be tracked and barcoded.
Barbing 2 hours ago
toraway 6 hours ago
croes 4 hours ago
sigbeta 41 minutes ago
I think this is a move by openai/google to prevent their own models from training on ai slop rather than some morally righteous public initiative.
rickcarlino 6 hours ago
What if they use advanced evasion techniques like printing it out and scanning it or taking a photo with their phone?
Retr0id 6 hours ago
SynthID is fairly resistant to this sort of thing, although not perfect.
cosmobiosis 2 hours ago
Well that's not very useful. I think that can easily be hacked and many people were doing that frankly
kube-system 8 hours ago
Is there no way to do this without uploading it?
woadwarrior01 7 hours ago
I'd built an on-device app for detecting C2PA and IPTC metadata in images, amongst other things. I might be able to add support for SynthID detection once it's been reverse engineered.
duskwuff 7 hours ago
Currently, there is not. OpenAI has promised "public verification tooling" down the line, but I'll believe it when I see it.
potsandpans 29 minutes ago
While this is definitely one of the topics of the moment. I find these threads really just ragebait magnets. A bunch of people effectively talking past one another: privacy vs preserving the status quo.
It's certain now that most of the Western world has slid into fascism. Privacy and common decency advocates are all but lost.
I will say this, for everyone celebrating this as something that is "extremely beneficial to the cultural moment",
If I were an adversarial nation-state actor, I might be extremely interested in reverse engineering this and poisoning the well by applying it to real images.
Let's make the world impossible to understand.
minimaxir 7 hours ago
I'm annoyed that Google is keeping it closed-sourced and limited to partners. Is there a negative externality about open-sourcing image watermark technology so anyone can use it and audit the watermarks independently? If not, then I may have a repository for an open-source invisible and tamper-resistant image watermarking approach that's feature complete...
thisisthenewme 7 hours ago
potentially to stop bad actors from poisoning datasets by just adding the filter to real pictures?
bsder 3 hours ago
The fact that they have to keep this closed source is a giant red flag. It means that you can copy it or strip it if you have the knowledge.
I'm not all that worried about stripping it (I'm sure that's trivial).
The problem that I am worried about is that it can be copied (I'd bet $20 that's trivial, too). People WILL put this on images so that they can be "discredited".
parhamn 7 hours ago
might be easier to strip it?
saberience 6 hours ago
What happens if you generate an image with only a single pixel color or say two colors?
akersten 5 hours ago
This was done in the past, Google saw it, and now either refuses to generate or doesn't emit the SynthID watermark for those images
userbinator 3 hours ago
You create an image so trivial that no one would care if it was AI-generated or not.
SiempreViernes 5 hours ago
You waste a lot of compute on overhead?
PunchyHamster 8 hours ago
so ? people wanting to make AI propaganda will just make tool to remove it. Possibly using AI to do it too
pta2002 7 hours ago
I assume a selfish benefit is that OpenAI and Google don't want the models to train on their own data. There is just /so much/ AI generated content online that they definitely need to filter it out somehow when assembling the training data. This is a pretty effective way to do that, with the nice bonus of being mostly good from a PR standpoint.
sgc 4 hours ago
I immediately thought that was the real reason. Their models will quickly break without some sort of consensus on how to reliably exclude them.