Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wonder how easily you could reproduce the system you created (how much do you remember? what sorts of resources did it require?) - and how where you worked would react/respond if you did so.

If it really did catch millions of emails successfully then such a system would be highly appreciated by the majority of the market.

It's not your fault if where you worked didn't see the value in what you'd created.



I think there aren't many startups/smaller companies trying to tackle the problem because: 1) there's not much money in it. Email is supposed to be "free", so is security. 2) mail systems, esp at larger/older orgs, tend be extremely baroque and you'd need full time integration engineers before making a penny. 3) Privacy laws make things difficult. 4) You need data and a wide view of the email landscape, and you only get that at large enterprises, and 5) at those orgs, the mail server operators are just waiting for the day they can retire and leave behind the mess they made.


We are working on this problem and hiring. Send an email to postmaster@protonmail.ch :)


Thanks for replying!

> I think there aren't many startups/smaller companies trying to tackle the problem

I'd agree there too - in those two specific cases.

> because: 1) there's not much money in it. Email is supposed to be "free", so is security.

Right.

> 2) mail systems, esp at larger/older orgs, tend be extremely baroque and you'd need full time integration engineers before making a penny.

No... that sounds intimidating, daunting and disillusioning.

> 3) Privacy laws make things difficult.

One word: Gmail.

See next point.

> 4) You need data and a wide view of the email landscape, and you only get that at large enterprises, and

Also one word: Gmail.

They've set what amounts to revolutionary precedence on perception of privacy and necessary ubiquitous data access.

> 5) at those orgs, the mail server operators are just waiting for the day they can retire and leave behind the mess they made.

See answer for (2).

Two alternative providers immediately come to mind: fastmail and protonmail. (Literally, these are what I remember right now.) The first has a large enough userbase they have interesting problems. The other is security-focused, and probably has adequate raw data to usefully tune an NN.

While poking around for fastmail stories I was sure I'd seen on here (it might've been another provider, but I think it was fastmail), dropbox caught my eye - they would of course need (and have) similar systems to this sort of thing too. This reminded me of sendgrid, and every other email mass-delivery provider that doesn't want to turn into a spam-farm.

Some extra consideration made me think of another idea, though.

I have a small trickle of false positives that land in my gmail spam folder (and get deleted) on a regular basis. This self-perpetuating cycle exists because I simply don't keep on top of what's in there and "not spam" everything that is technically not spam. So, my lack of interception is treated (passive-aggressively?) as agreement, and it happily chews away deleting all my old (almost-literally-spam-but-still) newsletter subscriptions and whatnot. The occasional activation email gets buried in there as well.

The key part here is that I don't feel like keeping on top of the spam thing, which is because I don't really _value_ what's in there. I trust my $mail_provider's spam system way too much to "do the right thing" (the provider is agnostic here) - and I don't mean doing the right thing morally, I mean it in the sense of "do what I mean". Which it doesn't, even though I wish it did.

I wonder how many other people feel the same [ambivalent] way about spam... and wouldn't mind simply just shoveling _everything_ in their spam folder to a "verification" service? The idea would be, you have a custom client that connects to and only reads from the spam folder, and then it un-trashes things it decides are not spam.

Gmail seems to provide API integrations to fetch only spam (pre-filled fields: https://developers.google.com/gmail/api/v1/reference/users/m... - I tried selecting for the "SPAM" and "TRASH" tags, but it seems to want a query of "in:spam" instead) and then trash/untrash messages (see bottom of opened node in list at left of that page). Naturally you can do this via POP/IMAP too but then it really is your word against your code as to what you're accessing (the query above currently only returns 15 results for me, which I just checked and is as correct).

As per https://developers.google.com/gmail/api/v1/reference/quota you can perform a cumulative total of 200,000,000 list+trash+untrash operations a say; all of them use 5 of the 1 billion quota units available daily. The main concern if you scaled sufficiently that you approached this limit is that asking for higher quota might provoke a "yes" in the form of a purchase offer :P which would have both pros and cons (stares really hard at the second word).

But besides Gmail, verification is kind of cross-platform and cross-vendor - and anything with scoped-list and untrash APIs could be integrated into this... in theory. I trust your mentioning of in-house mail being baroque, and I'm curious to hear how terrible this is in practice.

Perhaps you could have a cool UI that lets power users describe why the email is not spam - break the ranking metrics out into the UI in some way and let people provide feedback in useful (machine-readable!) terms that can (properly aggregated) be directly folded back into the network to train it.

I know Gmail has a plugin/extension API now, although I don't know how useful it would be to implement a UI for this sort of thing.

The hacky high-maintenance route would be Chrome/FF extensions.

Of course neither would work on mobile; to create a smooth experience there you'd basically have to implement a from-scratch email client with the custom spam views baked in.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: