People versus search engines

It seems that search engines are an immutable fact of early-twenty-first cen­tury exis­tence. We can’t escape them in any imme­di­ate sense, and can­not believe they could ever dis­ap­pear (I recall one instance on Whirlpool forums where a user thought his/her ISP’s inter­a­tional link must be down because he couldn’t access Google. This was one of the very few times Google had actu­ally dropped off the face of the planet for about twenty min­utes. It was sim­ply out­side the realm of possibility.)

Yet, increas­ingly, our surf­ing habits are defined by this bizarre social con­cept that seems to be shap­ing cer­tainly acqui­si­tions and web-two-point-oh-bubblism, wherein web­sites serve users by con­nect­ing them with one another, not on the basis of them know­ing what they wanted, but rather in a bizarre a pri­ori man­ner whereby degrees-of-separation (MySpace) or user-supplied-already-knowns (Live­Jour­nal, Xanga, etc.) define con­nect­ed­ness and dis­played content.

Search is no longer the macro-inter killer app, but an intra-site facil­ity applied to micro­cosm — often based on “trans­par­ent” tech­nol­ogy that has, on the basis of known knowns (in the words of a cer­tain Rums­feld), already done some of the hard work for users (I should say peo­ple, but don’t out of habit: it is an indus­try haz­ard) with­out actu­ally ask­ing them any­thing. This is where loca­tion– and organisation-based match­ing (cf. MySpace, Face­book, etc.) come in.

But none of this data is intel­li­gently search­able by generic engines.

None of this data (in the case of Myspace espe­cially, hor­ri­bly marked-up doing-everything-wrong-with-the-web tech­ni­cally entity that it is) is avail­able for index­ing by search engines because it’s not abid­ing by any defined seman­tics. There is not, for exam­ple, any over­whelm­ing use of micro­for­mats — hCard, etc. — for defin­ing con­tact details in any com­mon sense. Yet these things are search­able within a given website.

And, what’s more, these things are search­able with great pre­ci­sion within (social net­work­ing) sites. This is because of a very well defined inter­nal seman­tic (not the “seman­tic web”, but inter­nal data struc­tures) and an enforced obe­di­ence to these struc­tures that was never a part of pre-SocNet sites.

Soc­Net plat­forms are rad­i­cally dif­fer­ent from web 1.0 sys­tems in that they are (iron­i­cally) vastly more con­strict­ing. As “web 1.0″ I would cite Geoc­i­ties and free web host­ing ser­vices, por­tals, and all-things-to-all-people con­tent net­works. Now, we’ve got blogs (pre­cisely defined web­sites), MySpace (chiefly Soc­Net pro­files with bits on the fringes com­mon to the users, and now with enough impe­tus to appear unstop­pable), Flickr (free — and fee-for-service that peo­ple actu­ally pay for — web host­ing, pre­cisely defined as photo host­ing), and, strangely, a por­tal (Yahoo!) still on top of Alexa 500 rank­ings. A por­tal that owns both Flickr and Geoc­i­ties, but has changed the model of the lat­ter to place greater empha­sis on fee-for-service host­ing. But I digress into strat­egy — the point is not that, but rather in the way social data is stored.

Flickr is meta-data rich. It uses a well defined sys­tem based on EXIF, intrin­sic seman­tics (title, descrip­tion, tags — tags that get used prop­erly, unlike Face­book which doesn’t bother to make such things clear — I want Face­book to flop, by the way, because it annoys me, so don’t expect nice things to be said about it. It’s a poor closed-system imi­ta­tor, albeit with a stu­pidly effec­tive adver­tis­ing model every­one else should be wish­ing they came up with first but haven’t seen in order to copy… because it’s a closed sys­tem (or used to be) exclu­sive in scope. Which makes it very effec­tive SocNet/Web 2.0, by my own def­i­n­i­tion, so I don’t really have a basis for com­plaint.) and extrin­sic seman­tics (groups, pools, etc.).

Pro­files, unlike ‘pure’ Soc­Net (Myspace, Face­book), per­mit anonymity, but allow dis­clo­sure of as much as is desired: at any rate, that is not the pur­pose of the site. Myspace/Facebook’s rai­son d’etre is pro­files. (Well, and that and cash-cow-marketing-tool of the *R**IA’s of the world) Accord­ingly, its pro­files have very def­i­nite seman­tics even whilst the rest of the site may not (I speak of Myspace more, here). Myspace gives core “Details” pro­file info indi­vid­ual fields, whilst allow­ing a diverse “Inter­ests & Per­son­al­ity” infor­ma­tion in freeform textar­eas that are designed to entice users into par­tic­i­pa­tion (and, pos­si­bly, aid­ing more fuzzy searches — but mostly I think it’s just com­pelling con­tent, as there is no imme­di­ately obvi­ous way to search that data).

“Inter­ests & Per­son­al­ity”, along with blog con­tent, seems to be the only freeform con­tributed mate­r­ial avail­able on the site. Want music or a video with your pro­file? You’ve got to browse to the band’s site, load the player (no go in Opera with Flash at the minute, it seems), and then select “Add” on the track. They (yeah, it’s kinda big-brotherish) know exactly what song you chose, what band it’s from, what genre, etc. — that is to say, unam­bigu­ously and cer­tainly beyond a probably-common song title. This isn’t an upload-yourself-and-we’ll-manage-rights kind of thing. The offi­cial­ity gives that inter­nal data struc­ture that much more depth: but, again, the point is that the data is inter­nal and not open.

This, it seems, is the defin­ing qual­ity of Soc­Net. That’s what makes the ideas of open fed­er­a­tion advo­cated by Google Talk ear­lier this year so bizarre for the rest of us. We don’t par­tic­u­larly care, because closed sys­tems mean inno­va­tion (because we can define new data for our­selves to work with) and/or exten­si­bil­ity that isn’t pos­si­ble in an open plat­form (if, for exam­ple, not all fed­er­ated part­ners agree to a spec exten­sion — take, for exam­ple, Google Talk’s own Jab­ber base and pro­pri­etary VoIP on top of that). Open­ness is in Google’s inter­ests, because it’s so depen­dent on things being open for its core busi­ness (search). But real peo­ple want ser­vices that work, not ser­vices that push them to another site. I’ve never trusted sites that bounce me off to Google for their site’s search, even if it’s one of those crappy co-branded things. It doesn’t make sense. Why would you make some­one inspect your web­site from an infe­rior per­spec­tive when all the infor­ma­tion is stored in a data­base, with the pos­si­bil­ity of more seman­ti­cally mean­ing­ful search open inter­nally only?

Google won’t deal with your inter­nal search needs. It’s not designed to. It does a great job of deal­ing with pub­licly indexed mate­ri­als com­pletely aside from Soc­Net ser­vices. Soc­Net sites thrive on and are empow­ered by strong intrin­sic seman­tics that make clever profile-based (or UGC–based) search pos­si­ble, which builds loy­alty etcetera in a way for­eign to infor­ma­tional web­sites. Soc­Net is expe­ri­en­tial and (sur­prise sur­prise) social — it doesn’t have to be about anything.

Con­tent was deposed as king some­time in the mid­dle of the first decade of the twenty first cen­tury, and with that regime change his deputy, Search, was also shuf­fled to a some­what less promi­nent posi­tion. Some­where out of sight, Search’s iden­ti­cal twin, Query, is the real power behind the throne: it uses unin­dexed data and makes clever links to bring peo­ple closer together in a way that tra­di­tional search engines had never even envisaged.

Follow-up: Wireless AP draws ASIO

Sev­eral days ago now I posted regard­ing a story first pub­lished on Whirlpool, then mak­ing appear­ances on web­sites such as The Reg­is­ter (and prob­a­bly oth­ers), which details a heinous act of ter­ror­ism tak­ing place: and not on the part of the man who put up the Access Point, but rather by the enforce­ment agen­cies, who (so it is sug­gested) were intim­i­dat­ing in their deal­ing with the “offender”. Read the rest of this entry »

# by Josh on November 4th, 2004 Tags: , , ,
| No Comments »

Wireless AP draws ASIO

And today, it’s Aus­tralia being ridicu­lous: Whirlpool Forums — Thread: Wire­less AP draws ASIO. If this ends up going to air, I’ll make sure I watch TV for once.

Watch out, this guy is try­ing to kill his whole neigh­bour­hood with 2.4GHz microwave tech­nol­ogy! I setup a friend’s wire­less broad­band modem/router/access point over the phone last night: I’m expect­ing the cops to knock on my door any moment for col­lu­sion with terrorists.

Wireless broadband on the cheap

Whirlpool have just announced (last night) the release of new broad­band plans from Veri­tel, with insane dis­counts and ben­e­fits for Whirlpool vis­i­tors. Read the rest of this entry »

Using quota…

The fol­low­ing was posted on Whirlpool sev­eral min­utes ago:

Well, I’m on 512 Lite, and have finally resigned myself to the fact that there is no hope (in the near future) of get­ting a speed upgrade at the same pri­ce­point, even if it meant get­ting less quota than I have now (note, it is still pre-July 12).

As shock­ing as it may seem, I strug­gle to use even four GB (not four GB on/off peak, four GB alto­gether) a month. I’m now faced with the won­der­ous prospect of being able to down­load SO MUCH MORE! Wow! 24GB of quota to burn!!!

The only thing is, well… I’m at a bit of a loss as to how to use it. So, with this in mind, I turn to the Whirlpool iiNet com­mu­nity for help! My ambi­tion for the next month is to GET CAPPED! Yes, that’s right. I want to be capped. Not because I’ll enjoy the sen­sa­tion of slightly > dialup speeds, but sim­ply because I’m a stub­born prick who desires to prove a point.

In order to achieve this, I need lots of NON-BLIINKZONE con­tent to down­load! I don’t do pirated stuff, or pr0n, but all other con­tent will be DOWNLOADED!

I invite other iiNet users who are sim­i­larly con­fused as to how to use this amaz­ing new prod­uct improve­ment to join me in bring­ing the iiNet­work to a grind­ing halt… or, at least putting a dent in iiNet’s band­width bill. Sure, it is prob­a­bly insignif­i­cant to an organ­i­sa­tion of their size, espe­cially see­ing iiNet’s cus­tomer base is far broader than the cou­ple of hun­dred who read/post on Whirlpool, but it brings a feel­ing of slight satisfaction.

One other thing — if you’re going to list links, make them links to places that won’t be hurt by peo­ple leech­ing giga­bytes off them… for exam­ple, LinuxISO.org (yes, I know it’s sup­ported by adver­tis­ers… so far as I can see, that’s just ban­ner ads — I’d rather not add to their band­width bill unless it is necce­sary) and other web­sites like it.

Cheers,

J.

If you agree with me, please show your sup­port here: http://forums.whirlpool.net.au/forum-replies.cfm?t=213957

Here’s hop­ing a mod doesn’t take it down — we could get an inter­est­ing list of con­tent happening.

Later:
Yay! Recog­ni­tion! My thread was deleted by MM him­self! Hmm. Per­haps it did have poten­tial to cause prob­lems! Wahoo!

I’m feel­ing elated. The few posts I did see before it was taken down were largely sup­port­ive, with the excep­tion of a few by Python­is­man… he described it as “silly” — which I fully acknowl­edged — it is per­haps a lit­tle ironic that MM listed “Silly thread” as the rea­son for deletion.

The fol­low­ing was sent in a Whim to MM in response to the locking:

Deleted and not locked? Ooooh. Poten­tial to cause dam­age, per­haps…
Tee­hee­hee. Of course it was silly. I just think your changes are silly also… yup, I don’t have the num­bers, but I have inter­net resources which have been paid for to use as I please, which, unfor­tu­nately for you guys, extends to include point­lessly down­load­ing giga­bytes of any­thing just because I can.

J.

# by Josh on July 4th, 2004 Tags: , , ,
| No Comments »

Happy Birthday Kylie!

And lots of stuff like that!  Woo.  Hope you had a great day :)

I spent last night @ the Cat’s res­i­dence (a.k.a. the Cat-man) watch­ing a movie called “The Whole Nine Yards”.  Hehe, I watched TV!  Wow!

Also been spend­ing time burn­ing up the band­width as much as pos­si­ble stream­ing stuff, espe­cially off The­Base­ment.  Any par­tic­u­lar rea­son?  Nope.  I’ve decided that, just like the rest of iiNet appar­ently sees a need to, I must down­load as much as pos­si­ble just bea­cuse I’m allo­cated ridicu­lous amounts of quota by my… gen­er­ous ISP.  Who, just for the record, cur­rently have a dis­tinct lack of func­tion­ing inter­na­tional links, whilst West­Net, Pacific, Tel­stra (who is being blamed for the out­age) and mis­cel­la­neous other ISP’s are func­tion­ing per­fectly (how do I know this?  Hmm.  I’m lis­ten­ing to stream­ing music live from the US via a Pacific relay, surf­ing the inter­net using a Tel­stra proxy, resolv­ing hosts using a TPG DNS server (iiNets went out, again), and Whirlpool reports West­Net as work­ing fine.)

To be fair, iiNet have been decent for the last month or three, but still… UNCOOL.

Statistics and a gimmick

The two are com­pletely unre­lated, don’t worry ;)

The gim­mick?  It is a fish­tank which is USB pow­ered, from ELX.  Yeah.  And you thought the mug warmer was bad.

And the sta­tis­tics?  Well, it’s the end of the first month this web­site has been fully func­tional — so semi-valid num­bers for once ;)

Unique vis­i­tors: 203
Num­ber of vis­its: 366 (1.8 visits/visitor)
Pages: 7183 (19.62 pages/visit)
Hits: 11009 (30.07 hits/visit)

Some­what unsupris­ingly, the day with the high­est num­ber of hits co-incided with the day after Whirlpool heard about iiCom­mu­nity — traf­fic wasn’t sus­tained, though — it dropped down to nor­mal lev­els the day after that.

Hits, by Oper­at­ing Sys­tem that they came from:
Win­dows: 55.2 %
Linux: 42.7 %
Mac­in­tosh: 1.3 %

And by web browser:
Mozilla: 59.8 %
Some Flavour of Inter­net Explorer: 33.4 %
Opera: 2.9 %
and Netscape, com­ing in at a lowly 1.4%.

What other inter­est­ing stats are there?  Umm… Okay.  The day of the high­est band­width usage co-incided with the release of the first draft of my Imag­i­na­tive Com­po­si­tion — hehe, but it wasn’t that huge — chances are it was the pub­lic release of multiple-styles on that same day which was respon­si­ble for drum­ming up the traffic ;)

Ah well.  Next chal­lenge: get some real con­tent, get PageR­ank list­ings, sus­tain hits for the next month!  Hehe.  I have two weeks of hol­i­days left in which to do var­i­ous stuff, so per­haps I’ll squeeze in some cod­ing time for this website.

It’s not hold­ing the same inter­est any­more, though.  I mean… I enjoy it, in the problem-solving sense.  But only when a prob­lem is defined and there are clear expec­ta­tions and out­comes, not when I’m just ran­domly adding fea­tures to my website.

So, to define a need — some kind of file-storage thing!  Because peo­ple ask to see orig­i­nals of var­i­ous doc­u­ments (e.g. that short story “Blurred Real­ity”), and it’s eas­ier to pre­serve for­mat­ting and other such things in PDF’s than it is just copy­ing text into a web­form!  Well, that is one thing I want to work on.

What do other peo­ple think?  What else does this site need?  The doc­u­ment organ­i­sa­tion thing would ess­es­ntially ful­fil the wait­ing “Projects” sec­tions pur­pose, so don’t say that.

I think some­thing to show the most recent com­ments (ala Plat­form 7 front page) would be use­ful, although my design is too nar­row to do that well… per­haps another redesign?

Use the com­ments thing, tell me what sucks most about this site, or is miss­ing, or whatever!