A persona is a fictitious person that has certain defined attributes. In product marketing, we create personas for major groups of users who will use the product. For example, a word processor’s set of personas might include a high school student, a university student, a generic office worker, a specialist author, a manager, and so on.
In the world of black-hat SEO or spamming, a persona is usually a very shallow person, with no thought given to its creation. Beyond a rather implausible Western name and Gmail address, there is no sophistication, perhaps because the only purpose of that persona is to send once-off spam. Since you can create billions of fake Gmail/Yahoo/Rediffmail accounts without any worry, you can create a new one for each email if you wish.
Aaron Wall has written an interesting flight of fancy (I mean it as a compliment) in Google+ and probably had a lot of fun speculating how Google could determine a persona to be a real person. You can almost hear the chuckles in Amit Singhal’s and Matt Cutts’ teams at the Googleplex.
Aaron speculates that the following behaviours help to brand a persona as a real person:
For the details you will need to see his Google+ post.
Aaron has made a great start but IMHO other behaviours can be deduced. I spend most of my time with large corporate sites and reading the above with that lens made me shake my head. There is often no corporate Google account other than to create a WMT account at best. There wouldn’t be a credit card tied to that account. It wouldn’t use Maps to get directions. It wouldn’t watch YT. etc
Such a filter is fine for removing scraper sites from further evaluation, but I have a problem with his statement: “Of course no user will score super high on everything, but they can get probabilities & toss out usage data on anything below an 80% level of confidence.”
If this were so, then most corporate personas would fail, leaving their sites in peril.
I strongly believe that sites that pass a TrustRank (PDF) test with a high score are immune from checks that the rest of our sites have to endure.
Let’s leave spammers out of this article. At my Australia/New Zealand Directory I see many SEO companies submitting links on behalf of clients with a fresh Gmail address that is probably not used after an initial round of link submissions. They might use that address to submit some articles to directories and the really inept agencies might use it for comment and forum signature spam. That’s it.
What’s wrong with this picture? Anyone in our industry can spot one of these Gmail addresses as a fake often by looking at them. I delete whole chunks of waiting submissions merely by looking at the address and not the actual submission. They are always a text string that ends with some digits.
I am not a retail SEO, so I don’t need to do this, but in the interests of improving the industry, here is how I would go about setting up a persona (leaving out details that might help the wrong people):
I won’t elaborate on how to make these personas more convincing, other than to say that they should have been created a long time ago, gradually, perhaps from different cities when you were visiting them. Creating 20 Gmail accounts from the same IP address in one session is a bad idea.
I don’t use many Google services, such as Checkout, Picasa, Gmail.com address, etc, so I might score low in Aaron’s list of checkpoints. However, I use addresses that were created in 1994 and 2002 and have left a vast trail all over the web since then. Spoofed spam has been sent from one of those, but I have not noticed any lasting damage to rankings, if any. I do participate in Google+, Groups, Orkut, WMT, Maps, and some other Google services, so my various accounts should look very human.