Josh (the blog)

I’ve delivered simple, clear and easy-to-use services for 20 years, for startups, scaleups and government. I write about the nerdy bits here.


@joahua

Ansearch answers

All had been quiet on the Ansearch front as I awaited a response from Ansearch CEO Dean Jones, promised a hair under two weeks ago when I alluded to an earlier analysis/criticism I’d written when talking about the state of play with Australian search engines, specifically referring to the then-newcomer Ansearch.

Dean picked up my post via Technorati, a blog search engine that uses RPC update services to track what people are talking about in real-time. I was suitably impressed by this diligence and apparent desire to hear what the market has to say about their product: could this be the same company whose birth was so marred by a spat of cyber-squatting, in what Dean Jones was reported to have described as a fit of “youthful exuberance”?

Apparently so. Ansearch’s beginnings, though marred by dubious practices1, received praise from various quarters of the mainstream press — or, at least, those quarters not controlled by News Corp, whose domains had come under threat. However, the Internet community responded quietly, and those voices that were heard were mostly of disdain at Ansearch’s domain practices.

Strangely enough, my original post wasn’t about any of that. I hadn’t heard of Ansearch until I read an article on them in the SMH — an article which reads a little too much like a rehashed press release for my liking: the telltale sign is in the closing sentence “Ansearch is the search engine division of Optum Ltd.” — if it were filed in the Business section of their paper, I’d understand, but it wasn’t.

I wandered over to their site, played around for a bit, and decided their offering was mediocre. In hindsight, it probably didn’t help that I wasn’t shopping for anything in particular — according to a ZDNet article, “In the short term [Ansearch] is focusing very heavily on the commercial end of the market.” — but at that point in time, I also don’t think they’d tuned their listings particularly well, as a search for DashLite turned up my WordPress hack over commercial listings for the actual Dashlite brand I inadvertantly used.

I say “at that point in time”, because it appears to have substantially improved since, as per Jones’ claim: “Much has changed since your first article on us some 6 months ago.”

Much improved, it seems, on several fronts. Their core offering has shaped up nicely, and some facets of my initial complaints regarding accessibility have been met. Their ancillary product offerings seem to have developed nicely: Ansearch CEO Jones claims “Each of [our properties] goes through up to 7 stages ranging from an initial, simple SERP/Directory style page through to a more involved service, mini portal, search tool, etcetera.” He went on to say that these ancillary properties (such as http://www.picsearch.com.au/, http://www.videosearch.com.au/, http://www.thefreedictionary.com.au/ and http://www.messengers.com.au/ amongst several others) are currently being actively separated from the core Ansearch site (he described it as “quarantining”), and the exact direction of a number of these projects would become clear over the coming months, with the appointment of a full time manager of these online properties.

I’m a tad concerned about his description of their strategy with regard to these — he said this would become clear over the months to come, and I’m hanging off two words here: distributed portal. Whilst I can see this as being of value to users (especially for generic, non-brand-specific/legally dubious domains such as jokes.com.au and the ones listed above), it doesn’t seem to fit Ansearch’s core strength as I perceive it: as a commercial portal, and not as another Google. “We are not aiming to be another Google… we don’t have their budget and, to be frank, there are enough people trying to clone them: why build another?”

In fact, Jones suggested that Ansearch’s strengths lie in that it is not the ubiquitous search behemoth, and that its index is “something unique… something faster… [and] against the so called “arms race” of search (my SE has more links than yours etc…)”. I’d agree this is indeed a strength, and also a reason for them not to try and be a portal. Australia already has Yahoo! and NineMSN for domestic portals, and I’m struggling to see what Ansearch will do to differentiate themselves in this: but I’m happy to be surprised!

Ansearch apparently holds an index of only 500,000 websites considered by its metrics to be “most popular”. I argued that this was potentially a bad thing as relevant content might lie outside this realm: for example, this website performs well when people search for reviews of the HP 2610 or information about Apache on Ubuntu linux or ACT files from MP3 players that record audio, but isn’t included in Ansearch’s core index.

Which is perfectly valid, for a commercially-focussed site, I just think they could be missing out a little bit. They can leverage on my content for their advertising impressions and potential clickthroughs, because they have more valuable content showing up in their listing alongside advertised products. If someone reads my HP 2610 review after having found it in Ansearch, and decides they’d like to buy it and remembers having seen a “Buy HP printers!” ad on Ansearch, they’ll most likely click “back”. It’s abstract, behavioural stuff, but valuable nonetheless.

Whether it’s valuable enough for them to bother is another matter. “We spider our own content… something that over time will be done daily,” says Jones. “Having only 500,000 websites will allow us to index sites more often, and as is the case with the ‘site info’ pages, provide far more info on these pages.” Which is a value-add, and worth preserving. If that’s all resources permit, I think they’re doing the right thing as is. Jones openly admits Ansearch’s index of popularity “has a commercial flavour to it” — and rightly so. Given their much-touted gender and age demographic based search feature, this makes sense.

Their index of popularity seems to be fairly slow-moving. “Monthly we add around 20,000 sites… and take out 20,000.” I’d guess this would be the lowest 20,000 that gets shuffled, and this seems to make sense. One has to wonder whether all the higher-ranking pages can have substantially fresh content month after month, but presumably they do — it’s one of the things the SEO experts have always cried from rooftops.

It was interesting to hear Jones speaking about these people, too: amusing, even! Web developers the world over often join in speculation as to what exactly makes search engines tick, such that we can boost our clients (or employers) website’s performance. It seems the reverse is also true: search engines all over the world similarly speculate as to what those horrible developers are doing to screw with their indexes day in and day out!

I don’t say this in jest, and I believe they’re right to complain: “The larger SE’s are having a very tough time coming up with clever ways to index content to counter SEO… only to have SEO’rs quickly find ways around it. Cat and mouse…” I think “counter SEO” was a poor choice of words, given that relevant content should hopefully still be rewarded, but his point stands.

Just as interesting is Ansearch’s strategy to avoid falling prey to dodgy SEO tactics:

By only indexing the root page, we remove almost all SEO trickery. This works in 2 ways. Firstly, people rarely put spam on their home page — that is, doorway pages, link farms, etc. usually reside away from the main index… and, secondly, it deletes multiple results from the same website. It also stops the site owner/webmaster from saying they are relevant to 100 or 1000 keywords or phrases.

Kids, we just found a new argument against clients who love their splash pages!

Content rich front pages aren’t, however, an absolute solution (at least, not in Ansearch’s index). According to Jones, Ansearch’s policy of “ranking sites in true usage popularity, both on and offsite” is “SEO proof… or at the very least, extremely resistant.” I’d agree it’s a powerful metric, but my reservations above still stand.

One caveat of Ansearch’s algorithm that appears potentially exploitable is its failure to exclude content in the