I don’t think they’re saying that method would yield 100% clean data but it would give you all the “necessary” data with the absolute bare minimum storage requirement. At some point people will log into their email and for most people if you have their email password you have the password they use for everything
I think you misunderstand what 12ft.io’s business model is – they didn’t bend over for the NYT, the NYT bent over for 12ft.io by paying them to exclude them from the service. Literally the whole point is to extort money from publications to get on the whitelist.
I don’t think they’re saying that method would yield 100% clean data but it would give you all the “necessary” data with the absolute bare minimum storage requirement. At some point people will log into their email and for most people if you have their email password you have the password they use for everything