Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

In theory, yes, in practice it sounds like it could be a very expensive content filter. Then again, a simplified and cheap / optimized single purpose model would work. Given how creative people can get and how many different languages etc there are, it'd be interesting to see. But it would need to be a model that not only knows all the bad words and intents, but also learns the workarounds, like how Roblox users have invented creative phrases like "go commit die", "yeetus yeetus commit self deletus", or "go commit cease vital functions necessary for the prolonging of one's physical being".


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: