The model is a massive part of the AI-ecosystem, used by Google and Stable Diffusion. The removal follows discoveries made by Stanford researchers, who found thousands instances of suspected child sexual abuse material in the dataset.
I’ve heard there are specific data sets you can download that have the training data, but not the images themselves. Someone else already ran the images through a training model and you’re just grabbing the processed data and plugging it into your model. I’m sure I’m missing some nuance and haven’t looked into it myself, but I’ve seen that given as the answer when someone asked before.
IIRC from a previous thread, different law enforcement agencies will release hashes or similar so the image can be detected without distributing the original
I’ve heard there are specific data sets you can download that have the training data, but not the images themselves. Someone else already ran the images through a training model and you’re just grabbing the processed data and plugging it into your model. I’m sure I’m missing some nuance and haven’t looked into it myself, but I’ve seen that given as the answer when someone asked before.
IIRC from a previous thread, different law enforcement agencies will release hashes or similar so the image can be detected without distributing the original