Dear Google Mail Team

dasil003 · on July 17, 2015

I could see 20% false positives on spam for Linus equating to 0.1% of false positives across the board since I suspect the people emailing Linus are 200 times more likely to be running their own mail server than the general public.

EStudley · on July 17, 2015

I agree completely. Wasn't there a thread here a just few days ago discussing Gmail consistently marking personal mail servers as spam?

EDIT: Found it. https://news.ycombinator.com/item?id=9855030

ChuckMcM · on July 17, 2015

Yes, Google seems to have flipped some switch or pushed some change that takes well established domains and senders from that domain and for reasons that are not well understood written them off as spam. In theory they are getting a huge 'not spam' signal back at HQ but I agree with Linus that they screwed up big time. Stuff they should know wasn't spam and have in the past not classified as spam, now suddenly is. Algorithm update fail.

jarcane · on July 18, 2015

This started hitting several of my mailing lists a while back. Not just personal servers, but several major web mail providers, most notably Yahoo and AOL. Some theories floated around that it had to do with a change to header/certificates, but I don't recall if I saw any actual confirmation of that.

Too · on July 18, 2015

For some sender's I don't even have the "report as spam" button, even though most of the shit I get from them is ads I didn't ask for. Just because it's from a well recognized company domain.

mordocai · on July 17, 2015

I use DKIM and have Reverse DNS + SPF and gmail likes my email just fine.

However, previous to setting all that up gmail would commonly mark my messages as spam. I haven't tested it in the last few days.

fweespeech · on July 17, 2015

And the moment someone accidentally clicks the SPAM button you'll find yourself with weeks of pain on a low volume mail server.

Because, as an individual, you won't qualify for their FBL service and "mysteriously" you'll have weeks of everyone saying you end up in their spam folder.

kyrra · on July 17, 2015

The Gmail team recently launched[0] PostMaster tools[1]. I'm not sure if everyone has access, but maybe it would help?

[0] http://gmailblog.blogspot.com/2015/07/the-mail-you-want-not-...

[1] https://postmaster.google.com/

nmjenkins · on July 17, 2015

You have to be a high volume sender with a good reputation (low spam reports) to get access to these tools.

scintill76 · on July 17, 2015

And they don't tell you whether you meet those criteria, until after you go to the trouble of logging in and serving a DNS TXT record for ownership verification, as I just found. Granted, I didn't expect to qualify, but it would have been nice if they'd told me up front.

blfr · on July 18, 2015

Yes, but at least they're using the same verification mechanism as elsewhere so if you have already added the domain to Google Webmaster for example under the same Google Account, it will be automatically verified.

massaman_yams · on July 18, 2015

The reputation requirements here are a very low bar; as far as I can tell it's effectively "don't be a botnet".

sdenton4 · on July 18, 2015

So they send you five million "I'm not a robot" recaptchas, and let you through if you only click one...

lucb1e · on July 18, 2015

Fun, but it doesn't scale. Google can only do this because they have a near-monopoly on email. What would you say if I gave you tools to whitelist yourself on my email server? You'd tell me to get my spamfilter straight, or more likely, simply ignore me.

I'm not against Gmail, just like I'm not against Outlook.com or Yahoo mail or something. It's just that providing tools only work for players in a power position (i.e. Google) who can afford to ignore small players (i.e. me), and what's more, this further strengthens their power position: the better they can detect spam so more people will start using it (the postmaster tools are there to help people prove they are good, thus helping Gmail distinguish).

fweespeech · on July 18, 2015

I don't have access unless I use work contacts to push and I won't do that.

reitanqild · on July 18, 2015

I've seen otherwise intelligent people (not native English speakers but still) use mark as spam as an alternative to delete.

This was years ago but I guess that person was not alone.

fweespeech · on July 18, 2015

Yeah. It happens and when it does, it isn't them that is negatively impacted but you :P

Lunatic666 · on July 18, 2015

I had exactly the same setup, but I had to create 10 fake Gmail accounts, add the email address to the address book and flag several emails as "not spam" before it was useable. Google Mail just ignores the fact that there are private people who want to have their own mail server to be independent from Gmail.

bigiain · on July 18, 2015

<hat type="conspiracy">Not "ignoring", more "actively penalizing"…

voidz · on July 18, 2015

I'm actually inclined to agree. Google is just so hungry, they want two things: 1. your data; 2. try to take over the Internet. Bah.

cmdrfred · on July 17, 2015

I have all of that and it took about 2 weeks of people pulling my messages out of spam for it to work smoothly.

kbenson · on July 17, 2015

Were they set up initially, or after you noticed problems? I wonder if prior messages routed to the spam folder that people haven't marked as non-spam count against you for a certain prior period.

cmdrfred · on July 17, 2015

I added them after I noticed problems really. Postfix and Dovecot have somewhat of a learning curve (I started up and trashed a few VM's before I got it right). I ended up using IRedMail, the defaults are pretty much Gmail ready.

I don't know about prior messages counting against you, given what I've seen it seems to makes sense. Without insider info we can only speculate.

kbenson · on July 17, 2015

Which is just as well, since even if you can set up a mail daemon, if you can't set up the rest of that, you have no place running a mail server.

davidgerard · on July 18, 2015

This hit us at work really badly: we use GApps, and everything from our internal list server (which we can't replace with GMail aliases because $REASONS) was being sent to spam ... and there was literally no way to whitelist this across the company automatically.

You know how Google treats its free customers with utter contempt? I can assure you, they treat paying customers with the same contempt.

OJFord · on July 18, 2015

Pretty ironic, then, that many suggestions in comments (on the source) are "you should just run your own mail server". Maybe two 'wrongs' do make a right..

michel-slm · on July 18, 2015

Even if said personal servers have DKIM and SPF set up? (Just thinking out loud here, I don't have data either way)

aidenn0 · on July 18, 2015

Yup, I saw a lot of people complain about this, despite having DKIM and SPF

thaumaturgy · on July 17, 2015

The hilarious -- or maybe really sickening -- thing is that Gmail itself has also become a much larger source of spam than it used to be.

Those of us that are running mail servers are really not loving Google at the moment.

urda · on July 17, 2015

It's what happens when you can control such a large portion of a resource. Just imagine, Google can at any moment blacklist just about anyone. Most websites and companies they can do nothing to stop it.

It's a tad terrifying.

a3n · on July 18, 2015

I can see where it would be in Google's interest for people to throw up their hands and decide running their own is just not worth it.

madez · on July 18, 2015

That is why I don't need to able to send from my server to Google. Accepting tyranny is just not worth it.

justwannasing · on July 17, 2015

Haven't a clue what you mean. I run several mail servers and have no issues whatsoever with GMail. And I run through all the spam filtering, too, and only recall one that was inadvertently marked as spam but, iirc, the mailer was the problem.

glass- · on July 18, 2015

I see spam from gmail all the time, and for about the last year they wouldn't accept reports from SpamCop (that changed a couple of weeks ago).

thaumaturgy · on July 18, 2015

I wrote a daemon that monitors a mailbox that users can submit spam to, for spam that makes it through the other filtering layers. The daemon finds the original message in the user's mail directory, reads the headers, tracks down the originating network using whois, and then blackholes the network and emails me a notification about it.

Some networks are whitelisted for practical purposes. Google's one of them.

If necessary, I could provide a list of the subject lines of the emails that have been reported. A YC company, Zenefits, is one offender that comes immediately to mind.

But not until Monday. I'm going camping.

_delirium · on July 18, 2015

While we're waiting for Monday, here are some subjects of spam emails I've received from Gmail's SMTP servers recently. Mentions of my domains replaced by example.com:

   Example.com SEO Issues
   Tough Times With Example.com? Needs Attention!!?
   Get ranked higher: Example.com
   Google optimization for Example.com
   Get ranked higher : example.com
   Website Audit Report to increase website traffic
   Example.com - audit report now available
   Give a glamorous new look to your website
   Web Design Proposal example.com
   Organic SEO Promotion For Example.com

So looks like all SEO and web-design advertising.

voidz · on July 18, 2015

What a coincidence, that's Google's core business!

thaumaturgy · on July 20, 2015

Back. Here's a recent selection of subject lines, extracted from spam daemon replies. The "Zenefits + Problem" ones are funny .. my dba is "No Problem", and they're not smart enough to correctly handle that I guess, which makes sense, since they're also dumb enough to spam me. I've never replied to any of their messages.

    "Re: Zenefits + Problem"
    "Higher Targeted Traffic: Associatedtechs.Com"
    "Re:re: UL,CE,ETL Split-core Current Transformer(0.333V or 1A/5A output) ,Rogowski coil,hall AC-DC transducer"
    "Poor support processes could be costing you customers"
    "Get ranked higher:"
    "Higher Targeted Traffic: Associatedtechs.Com"
    "Re: Zenefits + Problem"
    "Re: Zenefits + Problem"
    "Your Website...!"
    "Get ranked higher: associatedtechs.com"
    "Google Update for: associatedtechs.com"
    "How to increase your website traffic and generating leads??"
    "Zenefits + Problem"
    "Mobile Apps Development"
    "Digital marketing proposal- www.associatedtechs.com"

This isn't a complete list, sadly, since it turns out the daemon isn't logging ban attempts against whitelisted networks. I should fix it so that it does.

austenallred · on July 17, 2015

I decided to do a quick check of my gmail; there were 151 emails in spam, and literally not one I would want in my inbox.

It seems entirely likely that Google is weighting whether something comes from a private mail server very heavily, and Linus, being who he is, gets a lot of email from private servers.

Not saying this isn't a problem, but it probably doesn't affect 99% of email users.

e12e · on July 18, 2015

> Not saying this isn't a problem, but it probably doesn't affect 99% of email users.

So, with a conservative one billion email users, only 10 million users see this problem.

wutbrodo · on July 18, 2015

This is pedantry, but 1 billion isn't all that conservative. GMail announced ~425M users in June 2012 and announced 900M a month and a half ago.

e12e · on July 18, 2015

Thanks for pointing out the real numbers. In my defence, I was thinking "conservative for the near future", so the next 5 years or so. An of course giving 9M users a bad experience isn't that great either.

austenallred · on July 19, 2015

If we're being pedantic, it's probably worth noting that the "99%" was a completely arbitrary assumption, and trying to run statistics based on one anecdote and an unfounded assumption is not going to produce accurate results.

e12e · on July 19, 2015

True. I suppose the main point is that with a billion users, even some "small fraction" of users having problems, is going to translate to a lot of people having problems.

Honestly, if Google took support even half-seriously (or: they considered eg: users of gmail their customers, rather than just their advertisers) -- these kind of issues wouldn't be so bad.

I do think it's just a question of time before Google relegates itself to irrelevance through a strictly inferior product though.

jimktrains2 · on July 18, 2015

> I decided to do a quick check of my gmail; there were 151 emails in spam, and literally not one I would want in my inbox.

I find about 2 a week that have been marked as Spam and shouldn't have been. Your experience is not everyones (nor is mine).

wahnfrieden · on July 17, 2015

Just FYI, most spam does not even reach your spam folder.

nchelluri · on July 17, 2015

What happens to it? It gets blackholed somehow?

leonatan · on July 17, 2015

Mail relay servers deny its acceptance.

wahnfrieden · on July 18, 2015

Google can also decide it's not even worth showing in the spam folder - the spam folder is for things it's less certain of.

themartorana · on July 18, 2015

True. We send thousands of emails a week and all of a sudden a large portion weren't even delivered. Turns out it's all DMARC and once those policies were fixed (assigned at all) the delivery denials ended.

This helped us:

https://dmarc.postmarkapp.com

brewdad · on July 18, 2015

Oddly, I have had the opposite problem. This past week I have had to delete 4 or 5 obvious V!@g&a!!!!!-type spam emails from my priority tab of my inbox. I can't remember the last time I had that happen. Something isn't working quite right.

jeffmould · on July 18, 2015

Have been seeing the same thing here as well. With spam messages coming through as normal mail.

I have also been having any messages that are replies to messages I send being diverted to my spam folder randomly. It is happening on both my personal gmail account and on a business account.

It seems lately Google has really been less focused on the core components that made them successful in the first place. I have found their search results seem to be returning more spam sites than before. My vote is to get Matt Cutts to come back and start cleaning up spam again :)

Bjartr · on July 18, 2015

I wonder if they rolled out some kind of new algorithm, or major algorithm change before it was ready for prime time.

pbreit · on July 17, 2015

"I would want in my inbox"

Unfortunately that's not the definition of spam.

baudehlo · on July 18, 2015

But that is his/her definition of spam. And as someone who has worked in the anti spam field, I'm ok with that.

However I have the same problems Linus has so I am not excusing gmail one bit in this.

pbreit · on July 19, 2015

But the people with a warped view of spam ruin it for everyone else.

detaro · on July 18, 2015

So by your generous estimate, it affects more than one million users. Sounds kinda bad.

knodi123 · on July 17, 2015

His point about marking one email in the middle of a thread is inexcusable.

I've seen that happen too. And only recently.

nnbvv · on July 18, 2015

I've seen it happen too (from my parents no less, who I have replied to 100s of times), but not recently.

Yahoo's spam filter also does this.

choppaface · on July 17, 2015

True, but the spam filter could certainly learn to weight private mail servers as Ham on a per-user basis. Perhaps the learning algorithm can't generalize the feature to new unseen private make servers. However, Google certainly has the engineers who can add that functionality, they just need to properly manage them to get it done.

fweespeech · on July 17, 2015

Yeah, its part of why I gave up on running my own mail server and signed up for Google Apps back when it was free. Its just not worth fighting the spam filters for side projects.

It was probably something along the lines of "Well consumers are what matters and they all use major services".

rodgerd · on July 17, 2015

> I suspect the people emailing Linus are 200 times more likely to be running their own mail server than the general public.

Assuming non-gmail servers are spammers is a pretty epic fail.

thaumasiotes · on July 17, 2015

Speaking as someone who is at least 200 times more likely than the general gmail-using public to receive mail from completely normal, mainstream Chinese email addresses... I'm still mad at gmail for just assuming mail from China must be spam. It's not spam!

(There's been improvement - for example, recently I received mail from someone I'd corresponded with in the past, and it wasn't initially marked as spam. Gmail used to be more aggressive than that, such that it would be marked as spam unless it was a direct reply to an email I had sent.)

(...for extra irony, that recent message was "I'm stuck in England and can't get home without a few thousand dollars". Her account had been hacked.)

dasil003 · on July 17, 2015

Value judgements aside, that's not what I'm postulating. It's more like "unknown" mail servers (meaning any mail server with low enough volume that Gmail doesn't have an opinion about it yet).

peeters · on July 17, 2015

What about using it as one of many criteria (maybe whether the email seems to contain gibberish being another, which patch files might be classified as).

bigiain · on July 18, 2015

Out of all the people in the world who regularly get mailed "gibberish" patchfiles, and not only fail to mark them as spam, but continue to interact regularly with the senders - do you not think it's reasonable to assume Gmail might notice Linus has been doing this for two or three decades?

I wondr what other forms of "gibberish" Gmail classifys as spam? GPG encrypted mail? Mail containing public ssl keys or CSRs? ANy foreign language not regulalrly hard in Bro-ville, South Bay?

massaman_yams · on July 18, 2015

What about the possibility that 'vanity' email servers are more likely to have something non-obvious misconfigured? Maybe, for example, they send spam rejections back to the envelope from address (generating backscatter which looks like spam), rather than rejecting spam within the SMTP session?

TylerE · on July 17, 2015

FWIW, I just checked my mail from the last week, and had 4 false positives, out of ~80 mails. That's WAY higher than I've traditionally had with Gmail, where I might expect that many in a year.

snogglethorpe · on July 18, 2015

Yeah, whatever the problem here is, it's almost certainly very user-specific, depending on the mix of email they get, and Linus is probably somewhat atypical.

I saw his original post on G+, and of course immediately went and checked my gmail spam folder.... and... no false positives at all, 100% correctly identified spam.

caf · on July 18, 2015

The funny thing is that according to the comments, some mail from Google itself is being marked as spam.

georgemcbay · on July 18, 2015

Last year google paid me $1500 for finding a security flaw in Chrome. The mail they sent me with the ACH information was marked as spam/phishing:

http://i.imgur.com/Icfeh60.jpg

viraptor · on July 18, 2015

He said it mostly affected mailing list messages. His subscriptions come from a single server, so random spam classification of messages in the middle of some thread doesn't really make sense. Not as a sending-server issue anyway.

massaman_yams · on July 18, 2015

You're probably right. Mailing lists (of the discussion group variety, not marketing mail) often have problems with spam filter false positives, most commonly due to DMARC policies.

There's not really a great solution to that at the moment - either you technically violate RFCs by having your your discussion group software modify some headers, or you deal with other kinds of breakage.

Doing header rewrites is effective for reducing FPs due to DMARC, but adoption is far from universal - off the top of my head I'm not even sure if Mailman supports that at the moment.

mike_hearn · on July 18, 2015

You only have to rewrite headers if your mailing list is actually modifying the mails i.e. doing a MITM attack on the mail flow. Some mailing list admins feel very strongly about footers, subject line tags etc and then claim they "must" rewrite the From header, but I am not sure it's technically required.

massaman_yams · on July 18, 2015

Discussion groups retransmit messages, which is enough to fail authentication in a lot of cases.

Here's an example: you have an address @google.com, which has a DMARC policy of 'quarantine'. You send a message from this address to a discussion group, which in the process, resends your message from a non-google server, thus failing DMARC.

Google's DMARC policy says that if an ISP receives a message from a @google.com From address and the message fails DMARC, that ISP should place the message in the spam folder.

So it boils down to: does a list operator change the From address in distribution group mail to use a list address they own in order to pass DMARC, or do you deal with the filtering consequences of failing DMARC for many domains?

mike_hearn · on July 19, 2015

The whole point of DKIM is that messages can be relayed without breaking authentication, because it uses digital signatures instead of sending IP. So I think it wouldn't break

massaman_yams · on July 22, 2015

... IF the body is not modified, and the header signature matches, AND headers retain DMARC alignment... the reality is that retransmittal (as opposed to just relaying) almost always does one or more of these.

Here's an example from a Google email engineer's recent post to the Mailop list, which is running Mailman software.

Authentication-Results: mx.google.com; spf=neutral (google.com: 2001:41c8:51:83:feff:ff:fe00:a0b is neither permitted nor denied by best guess record for domain of mailop-bounces@mailop.org) smtp.mail=mailop-bounces@mailop.org; dkim=neutral (body hash did not verify) header.i=@google.com; dmarc=fail (p=REJECT dis=NONE) header.from=google.com

mike_hearn · on July 24, 2015

That message says the body was modified. The solution is simple: don't do that. Your original message said DKIM breaks if you simply relay mail, but it isn't correct.

serve_yay · on July 17, 2015

I don't understand why you think that makes this acceptable.

dasil003 · on July 17, 2015

I don't understand why you think I think that makes this acceptable.

saurik · on July 18, 2015

The way you worded the sentence is classically used to indicate "well, this is an anomoly, because he is not a normal user, and is maybe even doing something absolutely abnormal, making this a trade off between the needs of the many vs the needs of the few, one he can easily bypass by doing something different inside his niche"; it took me reading a few of your later replies before I felt you were just providing an explanation for how Google's math could be flawed, as opposed to providing an explanation for why this might be an acceptable casualty.

dpweb · on July 18, 2015

I don't know about the spam filters but Torvalds' rants when he's onto something are awesome.

raverbashing · on July 17, 2015

Well, as he said, some emails on the middle of threads were marked as Spam

Funny thing that happened to me, a mail from Google was marked as Spam. This was a long time ago, and it was from a mailing list, but apart from that, it was a legitimate mail.

mike_hearn · on July 18, 2015

This happens because of DMARC. It does actually make sense, in a way.

DMARC allows a domain to say "email that claims to come from my domain must be signed by me. If it doesn't, burn it with fire, no exceptions". So Gmail is only following the instructions laid out by the sending domain.

This is helping to make the email ecosystem a lot more robust by ending the problem of From forging. Ordinary users rarely realise that the From header is otherwise meaningless so phishing them can be very easy.

However it does not play well with mailman's default settings, and a lot of mailing list admins refuse to help the email ecosystem become more secure (whilst often PGP signing their own mails, doh). So DMARC creates a lot of noise in the technical community from people who have to/want to use mailman based lists.

halviti · on July 18, 2015

On top of that, how many of those thousands of threads are unique individuals.

I would venture a guess that many or most of them are coming from similar people, or at very least similar domains, or even the same mail server.

alexott · on July 18, 2015

I got problems this week with mailing lists routed through apache servers & for which I had filters created, etc. And this happens not only for mailing lists

skrebbel · on July 17, 2015

yeah, things would be way better if everybody just used gmail!

billconan · on July 17, 2015

very funny, right after I saw this, I decided to check my spam folder just to see if anything important has been filtered out, and I saw an email sent by google marked as a spam:

this email is sent by google when logging in google account from a new machine. they tag their own email as spam ...

Hi xxx, Your Google Account xxx@gmail.com was just used to sign in from Chrome on Windows.

Don't recognize this activity? Review your recently used devices now.

Why are we sending this? We take security very seriously and we want to keep you in the loop on important actions in your account. We were unable to determine whether you have used this browser or device with your account before. This can happen when you sign in for the first time on a new computer, phone or browser, when you use your browser's incognito or private browsing mode or clear your cookies, or when somebody else is accessing your account. Best,

The Google Accounts team

notahacker · on July 17, 2015

At least their filters are as impartial as they're inaccurate and overly aggressive...

Zancarius · on July 17, 2015

I wonder how that'd work in an anti-trust case?

"Your honor, clearly you can see that our services are so impartial as to flag our own email..."

Something tells me that probably wouldn't work.

undergrowth54 · on July 18, 2015

If yahoo sued them? I don't see why it wouldn't. It may not convince an expert witness, but why wouldn't it convince a jury?

Zancarius · on July 18, 2015

Yeah, good point. I guess I was feeling unnecessarily cynical.

If I were on the jury, I'd definitely buy it. Their counsel would have to really bone up the argument that it was such equal treatment, they were flagging their own messages. And the prosecution would have to really stretch things, probably entering into conspiratorial territory in order to make a case.

Then again, I've found myself perplexed by jury decisions on tech-related cases more than once. Although having sat on a jury, I can see how such decisions might be made.

tempestn · on July 18, 2015

Another great one is when you're having a conversation with someone, and after several back and forth emails, the next one goes to spam. Sent from the same device, same headers, etc. Presumably it triggered some keywords. But I mean, come on! Obviously if I've replied to this person four times in a row, their fifth email is not spam.

I realize that everything takes time to implement, and developer time is not infinite, but this one seems like pretty low-hanging fruit.

meridional · on July 17, 2015

Not long ago, I found in my spam mailbox several emails sent by Google's recruiters (from @google.com domain). I was, at the time, going through their interview process.

ThePowerOfDirge · on July 17, 2015

I lost a job application because the interview invitation ended up into my spam filter. Thanks, Google Accounts team!

cortesoft · on July 17, 2015

Haha, I did the exact same thing and found the exact same messages. I thought maybe they were fake, but looking into them they look legit.

Good job, Google!!

billconan · on July 17, 2015

Yes, my first reaction was also thinking those were indeed fake google emails. but I checked the logging in time and the from email address, they are legit (I got two emails like this filtered this week).

those emails were sent from no-reply@accounts.I.google.com

billconan · on July 17, 2015

wait, there is a line on top of that email saying:

Why is this message in Spam? It has a from address in accounts.l.google.com but has failed accounts.l.google.com's required tests for authentication.

what are required tests for authentication?

pki · on July 17, 2015

DKIM, SPF, etc?

mike_hearn · on July 18, 2015

That sounds like a bug on Google's end.

secabeen · on July 17, 2015

I run my own mail server with full SPF, DKIM and SRS, routing the mail through a relay at a reputable VPS provider on high-reputation IPs. Over the last few months, there seems to be this pattern where if I email someone @gmail who I've never mailed before, they don't seem to ever get it. I wonder if this is the issue.

jcborro · on July 17, 2015

Ditto, this. Some recipients find mails in the spam folder, others claim they aren't even in there, but have simply vanished. All were accepted for delivery by gsmtp.

sliverstorm · on July 17, 2015

I had a problem with my personal server for a little while; all my SPF, DKIM, etc was all set up for IPv4. I have a couple IPv6 interfaces on my machine, and my mailserver was delivering to GMail over IPv6!!

ork · on July 18, 2015

Yeah, most of the time reverse DNS on the IPv6 address is not configured, leading to rejection.

DanBC · on July 17, 2015

Feel free to send me an email, and I'll let you know if it arrives in my gmail spam folder.

erikano · on July 18, 2015

I sent you an e-mail.

noir_lord · on July 17, 2015

This is one of the reasons I use Google Apps for Business for email (that and I find running mail servers tedious) as deliverability is consistently simpler.

svl · on July 17, 2015

Having to use a Google product instead of your own mail server, in order to have your email delivered to Google customers, sounds like anti-competitive business practices to me. You'd think they'd be a wee bit more careful with something like that given EU interest in them...

theflyingkiwi42 · on July 17, 2015

It also isn't enough. Some of our gmail users' email (sent through the gmail smtp server with oauth) will be marked as spam in gmail. Very frustrating!

npizzolato · on July 18, 2015

Your mail server isn't real competition to Gmail. Yahoo and Outlook probably are, but considering Gmail handles their mail fine, I'm not sure "anti-competitive" is the right angle here.

chc · on July 17, 2015

I doubt there are very many lawyers on the spam protection team. I also doubt the people on that team have any intention of harming minor-league competition. So it's not particularly surprising that they aren't more careful about avoiding something that a) is not well-known to them, and b) they don't think they are doing.

davidgerard · on July 18, 2015

It's just as fucked-up for our use of GApps, and customer service is just as absent for paying customers as free ones.

Microsoft could own business email again just by answering the phone.

danieltillett · on July 17, 2015

The problem with this apart from the cost is that all the other big email servers are more likely to mark your email as spam if it comes out of google apps :(

chc · on July 18, 2015

Do you actually mean Google Apps (i.e. Gmail) or are you thinking of Google App Engine? Because I don't think most other providers particularly distrust Gmail.

mordocai · on July 17, 2015

I also run like this and have had no problems. I don't commonly send emails though (mainly receive) so it is possible that I just haven't sent enough to see the issues.

chbrown · on July 17, 2015

I'm kind of surprised that Linus uses Gmail.

It's likely that he'll actually catch a Googler's attention, but for many of us, user feedback is not an option.

@jacquesm's http://jacquesmattheij.com/ham-or-spam-gmail-not-to-be-trust... is another recent instance — but again, there's no call to action.

Gmail is great for some people, but I prefer having more control, and I highly recommend https://FastMail.com if Gmail is failing to meet your needs.

RyJones · on July 17, 2015

Linux Foundation uses Google Apps for Work.

Source: I am an LF employee.

sebastianavina · on July 18, 2015

does the linux foundation have offices?

RyJones · on July 18, 2015

100% virtual. We have labs and a couple of the projects we support have offices, but no office like you're thinking.

sebastianavina · on July 18, 2015

why?

how do you got a work there?

RyJones · on July 18, 2015

why what? why no offices? keeps overhead lower, we like working from home, who is going to tell Linus to show up at an office every day?

glass- · on July 18, 2015

+1 for Fastmail. I really enjoy spam filtering based on personal bayes rather than a global database.

c5karl · on July 17, 2015

I've had a problem with false positives in my spam folder for months. A large percentage of the email newsletters I subscribe to end up in my spam folder every day, and clicking Notspam doesn't help. I can Notspam a certain newsletter every day for a week, and then the next day that same newsletter will end up in my Spam folder once again.

I'm starting to think that Notspam signals have no effect at an individual level. Either that or the button is simply a placebo.

Fortunately, the false positives for personal correspondence from individuals are still extremely rare, at least for me.

gerbal · on July 17, 2015

Yep, Gmail's spam filters work based on the collective judgment of Gmail users. The core of your and linus' problem is a lot of people use the 'mark as spam' button as unsubscribe button for mailing lists.

userbinator · on July 17, 2015

based on the collective judgment of Gmail users

I find this trend of "follow the majority" quite disturbing - it's as if they're implicitly saying that everyone should think the same way and punishing those who don't follow. What's spam to me may not be spam to you, and vice-versa.

Then again, having a personalised spam filter for each user would probably consume a huge amount of resources...

e12e · on July 18, 2015

Not sure why you are down-voted. Perhaps because everyone (that run their own mail) generally runs individual filtering per account. Typically spam assassin will score an email, but filtering (based on that, and other criteria) is up to the individual user (eg: by having a white-list, choosing spam score to treat as spam etc).

As mentioned further up, some scoring works well for many users, but not for all, such as marking eg: Russian/Chinese/Not-spoken-here-by-most language as spam.

I really see no reason for why Google should be so bad at classifying email as they apparently are.

massaman_yams · on July 18, 2015

That idea sounds compelling at first, but the data doesn't support it. There are plenty of email marketers who are focused on a non-technical audience (who presumably use 'mark as spam' to unsubscribe frequently) and which have no problems with spam folder placement.

There's a spectrum, and if a given sender looks considerably worse than average, they're more likely to get filtered.

If anything, if a newsletter is getting filtered, it's more likely to be the marketing manager's fault - perhaps they don't adequately monitor deliverability, or they don't test their content, or they don't use activity segmentation... etc.

pygy_ · on July 17, 2015

You can set up a filter to never mark the matching items a spam.

That's what I do for the Lua Mailing list.

c5karl · on July 17, 2015

Come to think of it, if they're going to have a "Never send it to Spam" flag in filter config, shouldn't it default to TRUE? If you're taking the trouble to set up a filter, it's probably because you are interested in those messages.

JoshTriplett · on July 18, 2015

Some mailing lists get spammed.

dottrap · on July 18, 2015

Wait, where is the "Never send it to Spam" option? I cannot find this option.

pygy_ · on July 20, 2015

I'm on mobile, so I can't send you a screenshot, but it should be un the list where you add tags, skip the inbox and so on, acter creating à filtre.

c5karl · on July 17, 2015

Sure, but I shouldn't have to do that. (That's a criticism of Google, not your good advice.)

massaman_yams · on July 18, 2015

Dragging the message to your 'primary' inbox sends a stronger signal than 'not spam'. Maybe worth a try.

click170 · on July 17, 2015

How easy is it to unsubscribe from those newsletters? Is there a one-click unsubscribe link that doesn't require you to login or enter anything before unsubscribing you?

If I've subscribed to a mailing list or newsletter and there isn't a one-click unsubscribe I'll click the Spam button to get it out of my inbox instead of going through their procedure.

One-Click Unsubscribe is paramount for mailing lists and newsletters not getting marked as Spam.

thrownaway2424 · on July 18, 2015

If your newsletter has a good reputation then clicking "mark as spam" in Gmail prompts the user to automatically unsubscribe instead of marking spam, or to do both. If your newsletter has marginal or bad reputation or does not offer automatic unsubscription then that doesn't appear.

danielweber · on July 18, 2015

Sometimes I don't want to deal with the people any more.

This may be an anxiety issue on my part. Fair enough. I will generally continue to do it.

LukaAl · on July 17, 2015

Happened to me as well. Some of the eMail were just "updates" email that I like to receive but if they get lost is not a big deal. But a couple of them were very important one, and to make things worst, they were answer to email in which I was in CC. So, a colleague of mine send an email to someone and CC me. The second person answer and that mail is marked spam for me but not for the person who wrote the original eMail. Doesn't make sense that an answer to a legitimate conversation is by default a legitimate eMail?

e12e · on July 18, 2015

Thankfully there's Hangouts, so we can all move off email... /s

cybojanek · on July 17, 2015

How much of this is caused by people marking mailing list emails as SPAM instead of properly unsubscribing?

therealarmen · on July 17, 2015

I don't blame them when the Unsubscribe link is buried at the bottom of a 8MB marketing newsletter in 4px font.

pd1 · on July 17, 2015

Or when you need to log in again to unsubscribe

eCa · on July 17, 2015

Which is illegal in the US[1] (and I believe EU, as well), and hence deserves to be reported as spam.

[1] https://news.ycombinator.com/item?id=4496688

TylerE · on July 17, 2015

A big problem I've had is that many of these are "business relationship/transactional" e-mails, which play by different rules.

My address is my first name + last initial (neither of which are all that uncommon), and this is made much worse by Gmails idiotic ignoring of periods in addresses. There is a dude in Denver, CO who is absolutely convinced his e-mail address is tyler.e@gmail.com. It isn't. I'm really sick of getting his AT&T and car insurance e-mails.

tizzdogg · on July 18, 2015

This happens to me all the time as well. I also have firstname.lastname@gmail.com and apparently a lot of other people seem to think they do as well. Or at least, they have firstname.lastname1@gmail.com and people easily forget to add the number.

I wish there was a better way to deal with this type of situation other than constantly sending "please fix your address book" emails. Email is a broken system.

e12e · on July 18, 2015

I'm not sure how you go from: Unless you have Google Apps for Business (or whatever), there are no vanity domains for gmail; to: email is a broken system?

Gmail.com is certainly broken in the sense that they want to cram 10 billion users into a single domain. It's ridiculous marketing/brand-motivated UX failure.

Since forever most mail services had a few vanity-domains, so people could get first.last@wherever.com. But no, Google doesn't want to provide email, they want to provide "Google Mail".

Apologies for the rant, but I can't stand it when big companies create problems through stupidity.

tizzdogg · on July 18, 2015

I dont believe I said anything about vanity domains, so I dont know what you mean by that.

I meant that email is broken in the sense that when some stranger mistakenly thinks that your email belongs to them, and continues to give it out or sign it up for mailing lists, you have absolutely no recourse. If you have an email address that like mine is easily mistaken for other ones you get incorrectly-addressed personal emails many times a day. There is no way to find the actual intended recipient or get in contact with that person to say "hey you seem to be confusd, stop using my email address". And there is no really good way to filter those emails, since after all they are coming to your correct address. I think this is the kind of problem that's difficult to appreciate unless it happens to you frequently.

The problem with email is that anyone can email you if they have your address.. thats why we have so much spam. I dont know what the solution is, but it would be much better if the recipient had to opt-in to the conversation somehow as well.

dingaling · on July 18, 2015

> so I dont know what you mean by that.

I believe he means that if tizz.dogg@gmail.com was already taken, Google should offer tizz.dogg@loopyloop.com rather than tizz.dogg1@gmail.com. In fact they shouldn't even show that as an option.

Since Google Is now a domain registrar they could create the new domains on the fly.

Then there wouldn't be namespace collisions.

e12e · on July 18, 2015

Indeed. Google/Gmail do two strange things: a) While they support the age old username+whatever@gmail.com in order for users to hand out special addresses to mailinglists etc (eg: user+facebook@gmail.com, user+lkml@facebook.com) -- they don't distinguish on dots in the username (so username == user.name == u.ser.name etc). And b) they don't offer other domains than gmail.com, which leads to strange things like smith1, smith79 etc.

As for there being "no recourse" -- apart from spam, that's just wrong. It's much faster to reply with a "This is not your Smith"-mail, than it is to write a "return to sender" on an envelope. Same thing for getting phone calls from a different timezone etc.

[ed: I do agree that it's a bit more difficult with people that don't know their own address -- still think it should be quicker to reach their contacts via email than via comparable means.]

tizzdogg · on July 18, 2015

>> It's much faster to reply with a "This is not your Smith"-mail

Right, and I've sent literally hundreds of those emails. They almost never do any good, because while one person may fix your address in their contacts list, the original person who gave out the faulty address is still out there, unaware that they're giving out bad info. I always ask if the email sender can tell the intended recipient about this when they figure out the right address, but that rarely works. Anyway, I know this is a very specific problem that only affects a small fraction of people, but it's extremely annoying.

I dont really see how allowing other domains would help.. that just shifts the issue to the domain string instead of the user string. I guess it gives people more options. But one of the main benefits of gmail addresses is that it's so common. Everybody knows it, so nobody ever misspells the 'gmail' part at least.

pd1 · on July 22, 2015

But isn't this true about phone numbers, addresses, and other things that have been around for longer than email?

brewdad · on July 18, 2015

I have a rather uncommon first.last combination. Still, there is a woman in a flyover state married to a guy with my same name who seems to think my gmail address is his. I am building up quite the profile of their family. Thanks to Gmail I know where he works, what car she drives, where they went to college and where they fill prescriptions. Fortunately for them, I have no desire to use any of this information.

It is annoying however especially since most of the spam in my spam folder is addressed to her through my email address.

detaro · on July 18, 2015

Can't you just set a filter on the wrongly dotted address? Not something you should have to do, but...

TylerE · on July 18, 2015

I suppose I could, but I"ve seen MANY variations, and tbh there are enough mis-sends to tylere@gmail.com that it wouldn't really help.

TBH I hardly use personal e-mail these days, it's basically a bucket that receipts and confirmation gets dumped in to, in which case search works well enough. Most actual conversation is done via Facebook or IM, etc.

Bill_Dimm · on July 17, 2015

Which is presumably done so that if you forward the newsletter to someone else they can't (accidentally or maliciously) unsubscribe you by clicking the link.

click170 · on July 17, 2015

When it's trivial to re-subscribe if you want to, I'd prefer that they do include a one-click unsubscribe. I'll deal with anyone who maliciously unsubscribes me from a mailing list that I want to be subscribed to, and this will highlight who my "good" friends are anyway.

thaumaturgy · on July 17, 2015

I'd assume cybojanek is talking about majordomo-like mailing lists, the sort used by lots of open source projects, which typically require an email sent to a specific address with "unsubscribe" in the subject line and so on and so forth.

click170 · on July 17, 2015

Curiously, having to email <mailinglist>-unsubscribe@server.com to unsubscribe from a mailling list is, to me, preferable to having to login to a website in order to unsubscribe from the same mailing list.

I think it's because I'm beginning to resent the idea of every website and its dog requiring that I have a user account before I'm allowed to even browse the content.

thaumaturgy · on July 17, 2015

You're not alone. I much prefer majordomo to any other web-based mailing list system. I just chalked it up to a greybeard quirk; HN at large seems to be the opposite.

notahacker · on July 17, 2015

Relatively little. Gmail's filtering system just seems to be increasingly erratic, so one Meetup group email ends up in the Spam filter whilst another appears with high importance. If anything, the problem appears to be the opposite: de-emphasising the sending organisation.

lilyball · on July 17, 2015

How much of this is caused by marketing mailing lists sending messages to people who never subscribed?

I know I get a fair amount of unsolicited marketing list messages that do have an Unsubscribe link, which I click, but I also mark as spam because I never subscribed to it in the first place (of course, I'm also not using Google Mail, I'm using FastMail with a personalized SpamAssassin filter, but I assume it will still influence the global default SpamAssassin filter).

Tinyyy · on July 18, 2015

Well if I didn’t explicitly sign up for the mailing list, I’m just going to report it as spam.

imh · on July 18, 2015

Just because there's an unsubscribe button doesn't mean it's not spam.

Too · on July 18, 2015

Sounds like there should be another choice after you report something as spam: Spam - ads I didnt ask for. Spam - fraud attempt, Spam - I unsubscribed 5 times but still get updates, Spam - my ex keeps stalking me, etc.

incepted · on July 18, 2015

After reading this, I went through my spam folder and it's looking overall quite okay EXCEPT that all the comments on Google+ in response to things I posted these past few weeks are marked as spam. All of them.

Yup: Gmail is marking comments originating from Google+ and written by legitimate users as spam.

tortilla · on July 17, 2015

Wow, just checked my spam folder and there was an important email marked as spam by Gmail. It was from a known contact I had already corresponded with.

netheril96 · on July 18, 2015

Every spam filter has false positive. You should make a habit of checking Spam folder periodically, whatever email services you employ (unless you do not enable spam filtering).

tempestn · on July 18, 2015

What would be really fantastic is if Google let you set your own spam threshold. I can't even imagine it would be too difficult. Presumably they determine a numeric 'spam likelihood' number for each incoming email already, so this would just mean being able to customize the threshold that that number is compared against. Obviously entering it numerically wouldn't be very user friendly, but even 5 levels from most to least aggressive (like Spam Assassin and such offer) would be extremely helpful.

Even better would be if you could have different handling for different levels, like black-holing or auto-trashing the absolutely-definitely spam, making it easier to occasionally scan the regular spam box. I get something over 1000 spam emails per day, so it's just not feasible to give even a cursory look over them to find the false positives. I can't even imagine what it would be like for someone like Linus.

Unfortunately, that would draw attention to the fact that the spam filter isn't perfect, and would require users to make choices with tradeoffs, so I can't imagine it's a very attractive option for Google.

scrollaway · on July 17, 2015

I am subscribed to the wine-devel, wine-bugs and wine-patches mailing lists (https://www.winehq.org/forums). Having the exact same issue.

It seems to very easily flag discussion about .exes as spam, it's really disappointing. It's been several years and the filters haven't improved, despite me religiously flagging spam/not spam in those lists.

In the end I just gave up and set up filters to specifically prevent marking incoming emails on those lists as spam. It misses the odd linkedin invitation, but it's not like it was catching it before...

milspec · on July 18, 2015

You are "flagging spam/not spam in those lists" but Google may then associate that list (not the content) as being spam or not. Google sees that mail comes from the list and decides based on that.

This is why I never mark mailing list email as spam, even if it is in fact spam.

scrollaway · on July 18, 2015

That's my point. Everything that went to spam, I flagged as "not spam".

milspec · on July 18, 2015

Yes, but did you ever mark mailing list email as spam? (actual spam even)

If so, maybe gmail thinks "this sender is spammy". That's the mailing list.

scrollaway · on July 19, 2015

I don't think I did; but if I did, I'd argue gmail should know better.

noinsight · on July 17, 2015

He already got a response from the Gmail product manager, must be nice being Linus.

Notice the comments from Sri Somanchi. He's listed as the product manager here: http://gmailblog.blogspot.com/2015/07/the-mail-you-want-not-...

mkhpalm · on July 18, 2015

The unfortunate thing about gmail is its gotten worse than AOL was to deal with on the spam front. I feel like there are historical lessons worth learning regarding where all those @aol.com users went.

1. Everything became "spam"

2. They got to a point where they believed they were the standard

3. Nobody could do anything about it

I can think of many places where this same situation has played out. Its yet to work long term without disastrous results after a reign of technical darkness. That doesn't seem to stop people from thinking it won't happen to them.

Its fine to aggressively fight spam. If you choose to error on the side of false positives then its in your own interest to provide reasonable recourse. If not, you've left a very large gap that somebody else will come in and fill. Just as google did.

elevensies · on July 17, 2015

It might not be related, but I've been seeing some spam in my gmail inbox in the past month. It seems that something has upset the balance. For example, this went to my inbox:

From: [...] Baby <[...]baby@gmail.com>

Subject: HELLO HANDSOME

Body: HOW ARE YOU DOING

alexdmiller · on July 17, 2015

Interesting, I've been getting very similar spam messages slip into my gmail inbox.

hauget · on July 17, 2015

Likewise.

TillE · on July 17, 2015

Same. After years of almost zero spam, recently I've been getting stuff like that about once a week in my primary inbox. Very annoying when my phone dings only to show me obvious spam.

realusername · on July 17, 2015

I have the same problem, before these emails would go into the SPAM folder but now even some obvious 'YOU HAVE WON THE LOTTERY !" emails are arriving into the inbox directly.

frik · on July 18, 2015

Have you checked the mail raw text. Maybe it's just disguised as gmail but sent from another third party server. Or spammers found a way to spam within gmail or hacked gmail. Both had happened at least once.

qmalxp · on July 18, 2015

A few months ago, I wrote a simple Android app and put it on the Play Store. Now every two weeks I receive unsolicited spammy emails about ad campaigns and increasing user awareness. Funny how those get through the filter.

throwaway2048 · on July 18, 2015

sort of doubly ironic that many other Google service emails that you actually want to receive get filtered...

blfr · on July 18, 2015

They also started delaying or outright rejecting some mail more aggressively so you don't even get to find it in spam. A few days ago I received a confirmation code from my CA sent to hostmaster@ the next morning after I requested it.

What's even worse they rejected email to postmaster@. I know you can adjust the spam filter sensitivity somewhere in Google Apps but come on, you should not reject any mail to postmaster by default.

lucb1e · on July 18, 2015

The real shocker is that Linus Torvalds uses gmail, where you have no control over anything regarding your account (exhibit A: look an awesome new spam filter which you can't turn off!). I would never have thought he'd do that.

becausecomputer · on July 18, 2015

No. Google Apps for Work.

lucb1e · on July 18, 2015

Well same thing, isn't it? Clearly he is forced to use a spamfilter with no opt-in or opt-out.

jbit · on July 19, 2015

Google Apps has a few more knobs at least: https://support.google.com/a/answer/2368132?hl=en

balls187 · on July 17, 2015

"Check your spam folder" is now a default instruction for automated email notifications.

Luckily it's pretty easy to scan the folder for valuable messages.

However, having to do that is clearly not ideal.

Had a wedding RSVP get flagged as spam.

jrapdx3 · on July 18, 2015

FWIW after reading this article, and comments here, I decided to check on the gmail account of a small not-for-profit organization I belong to (I'm the unofficial IT guy). I was shocked, there were 4127 spam messages, and just 98 unread items in the inbox. Slogging through the spam I did find some non-spam mail, but altogether that was <1% of the 4127 spam items.

Of course gmail deletes spam more than 30 days old, so how does it happen than an obscure educational non-profit gets over 4000 spam messages a month? Gmail must be a huge spam magnet, but still a mystery how those messages find their way into this spam bucket. (Unless in the past somebody had abused the account and the email address is on a thousand spammers lists...)

In any case hard to be certain what criteria the spam filter uses to declare a piece to be "spam". Not all the misclassified emails were sent from "private" servers, it would be useful if it was more clearly specified.

bliker · on July 18, 2015

I’ve noticed the same thing. Do you by any chance have a catch-all address? That was the major source of spam in the inbox for me.

barrkel · on July 17, 2015

Mailing lists are a massive source of false positives for spam. I've pretty much given up on trying to use gmail to subscribe to them.

click170 · on July 18, 2015

Subscribing to high volume mailing lists from gmail is something I would actually advise against.

I had legitimate emails bouncing because the mailing list had put me over the maximum number of emails that a free Google account can recieve in a day. I didn't even know there was such a limit until I hit it.

I now run my own mail server and have none of these or the other problems outlined here.

rn222 · on July 17, 2015

Google Mail Team, please test all future spam algorithm changes against Linus' inbox.

rcarmo · on July 18, 2015

I've been on the other end of this for a few days - basically any company I do business with who has their e-mail on Google simply doesn't get my e-mails. Sometimes I get an active bounce (i.e., a reject due to my originating address), sometimes... Nothing. No pattern, either. Same destination, different behaviors.

The mail simply does not reach my suppliers'/partners' inbox, and as a result we're all losing time and patience with this

The really funny thing? Some of those people I work with are @google.com.

(And yeah, my corporate domain is clean, SPF'd up to the wazoo, etc.)

Jemaclus · on July 18, 2015

I've been actively interviewing for the past few weeks, and I've noticed that a very large number of emails from companies and recruiters (mostly recruiters) have been marked as Spam or at least shunted off to a non-Inbox folder. I don't have any specific custom filters in place, so this is 100% Gmail's doing. I find that interesting and frustrating -- and none of these companies/recruiters have ever seen this before! Seems like a relatively new phenomenon, at least for these people.

I wonder what Google's internal metrics show...

junto · on July 18, 2015

This is mainly due to the fact that so many recruiters have poor practices. I have a rule that if I'm contacted by a recruiter (unsolicited) I politely ask them to remove my details from their database. I label their email as unsubscribed and archive the email in Gmail.

When I get a second email from a recruiter that previously has been marked as unsubscribed then I click that 'mark as spam' button.

I've been doing this for eight years, since I'm quite happy with my work situation and have actively tried to remove myself from their databases over that time. However my CV still seems to be floating around, even though I've also deleted it from every online job portal I was signed up to.

At some point most recruiters need to feel that pain, because they just don't listen.

jimktrains2 · on July 18, 2015

> recruiters have been marked as Spam

As they should be, because they are. They have terrible practices and I have never met one I would want to do business with.

jpindar · on July 18, 2015

I get quite a lot of spam that is meant to look like it's from a recruiter, but are clearly sent at random.

datashovel · on July 18, 2015

I think even Google should fear the prospects of Linus Torvalds on a mission to "fix email".