Aelon - Gaming & Technology Blog. 9rules Network
  • Blog Founded: July 20, 2004
  • Total Entries on Blog: 239
  • Most Commented Entry: Jack Thompson... Straw Man
  • Total Comments on Blog: 2095

Aelon is a collective blog based on video games, technology, and general geekery. It is also a member of the 9rules Network, a large group of independent blogs dedicated to quality. Check it out.

‘Hello? …Is This Thing On? …’

By Kelmon

Image of old-style microphoneI’m pretty much always in awe of many of the blogs where authors manage to come up with new content on a daily basis. Personally, I’m not a big writer of articles (a couple recently, which is pretty rare) and have pretty much exhausted the topics that I could think of. So, when it came to wanting to add some new content to the site and keep it from getting a little stale, I was a little stuck. Still, what’s the best idea in these situations? From the blogs that I read it seems to be to write about your experiences, so here goes…

Last night I started talking to my computer…

For a while now I’ve been phoning home to my parents and friends in the UK via Skype using the Mac OS X client, a pair of walkman headphones and my laptop’s integrated microphone. Anyway, last night I was trying to see if my microphone would work better if I calibrated it and went into the System Preferences looking for anything that might help improve Skype conversations. What I came across, however, was a feature of the OS that I’d not come across before and, while it entertained me for 1/2 an hour, drove my wife nuts.

OS X Speech Recognition Settings PanelWhat I came across was the Speech Recognition software built into OS X. It’s not full speech recognition of the likes of Dragon NaturallySpeaking whereby you can dictate into the likes of Word but the Speech Recognition software will allow you to do stuff like move around in a document, open email, switch applications, get the phone number of a person in your Address Book, etc. To be honest, I found it all to be kinda neat (to the point that I’d use the word “neat” in conversation) but not without flaws.

Screenshot of using Speech Recognition to make a new Sticky Note of selected text in EctoThe principle problem with the software is that it’s not terribly accurate, or it just hates my accent. The little setup screen allows you to calibrate the volume of the microphone and test the software with some test phrases, which all went pretty well. However, when it came time to try and use it quite often it either wouldn’t understand me or, worse, would misunderstand me completely. I must have spent 15-minutes working my way through the “Tell me a joke” command that basically generates a series of Knock, Knock “gags”. The software would understand my “Tell me a joke” request but hardly ever understood me saying “Who’s there” and the other parts of the joke. For those interested, the only joke I did manage to get to the end of was the following:

“Knock, knock”

“Who’s there”

“Thermos”

“Thermos who?”

“Thermos be a better way”

Yeah, no shit, Sherlock…

The software does tell you which commands are understood, which is nice if you’ve lived all your life under a rock and don’t know how a Knock, Knock joke works…

Similar issues were experienced when changing applications. For some odd reason the software found me totally comprehensible when asking it to “Switch to OmniGraffle Professional” but kept interpreting any requests to Open/Change to/Quit iCal as Open/Change to/Quit iChat. I tried faking an American accent to see if that would help things but the only American accent I know is from the likes of Mississippi/Tennessee, so I don’t think that really improved things…

The other principle problem was not with the software itself but because I felt a bit of a tit issuing voice commands to my computer. This is not to say that talking to the computer isn’t an entirely alien concept as I frequently swear at the likes of MS Word when it starts playing silly-buggers. But still, despite the fact that I’d grown-up watching the likes of Star Trek and marveling at how great it was to be able to talk to the ship’s computer rather than using the keyboard on my old Commodore 64, in practice it feels kinda embarrassing. This, perhaps, wasn’t helped by my wife telling me that she’d divorce me if I didn’t stop saying “Who’s there” in assorted styles/accents.

Obviously, it’s easy to see the benefits of such a system for the physically disabled but I’m wondering whether there is ever likely to be a time when I’d prefer to use such an interface instead of the usual keyboard/mouse. I’ve spoken a few times on why, in my opinion, the keyboard/mouse combination is awful for games and have noted that Nintendo’s Revolution controller looks like a step in the right direction, but I’ve not extended this to “normal” human/computer interaction before. While vocal communication might be a while away, if only for the fact that the English language is so phenomenally complicated, perhaps other forms of interaction should be considered. For example, I still think that the “hand as physical pointer” interface that Tom Cruise employs in Minority Report to move “windows” around a screen shows a lot of promise. Yeah, touch-screen styluses go some way to providing this kinda interface but they’re just not as cool…


  1. #1  Vermouth
    27th September | Reply

    I think if such a thing is going to be developed it’s going to come from the far east. We’re just not there yet but the East Asian territories are the one place where it makes sense to spend a bunch of money developing it because the written asian languages just don’t convert as well to a keyboard as European languagaes. Frankly spend a few months working in a call center and your typing speed nearly catches up to people’s talking speed. It just seems more cost effective these days to train people at typing than it does to spend huge sums of money on development of software that very few people in the West would even have a need for.



  2. #2  Cyrris
    27th September | Reply

    I do feel like a bit of an ass talking to my computer screen when I’m on Skype, but conference calls with friends are pretty fun so it makes up for it.

    One thing similar I tried for the first time the other day was on my friends new PDA. I was able to hand write my words in MSN, and the PDA wasn’t always good at interpreting it. That said, I was surprised at a few of the times where it got rather complicated words right. Maybe it depends on how neat I write.



  3. #3  Thornhillboy
    28th September | Reply

    I realised recently than when I am on the computer, I am deadly silent. I do not make a sound, apart from the occasional chuckle and the times I break into song to stop myself going insane.

    But I like the silence. As my computer is in a private place, talking to it, or even to friends, would just seem silly to me. Heck, I even hate phones!



  4. #4  Kelmon
    28th September | Reply

    Hmm, video conferences wouldn’t make me feel too silly talking to my computer since at least a real person would be talking back. Still, that’s not really an interface issue so I’ll move on from that one.

    Actually, I seem to recall having seem, fairly recently (and less busy people can hunt down a link from Wired or somewhere) that human/computer interfaces where the human can control the computer with their thoughts are working in the lab. WITH THEIR THOUGHTS! It’s pretty much for the disabled, again, and so far seems to be moving a mouse cursor but that’s still really cool. What kinda worries me is that with the average human male supposedly thinking about sex once every six seconds, you’d spend all your time with bloody porn sites appearing on-screen. What you get up to at home is your problem but when they start appearing in the office while your boss walking around, then that’s a problem…



  5. #5  Kyle
    28th September | Reply

    I actually just recently to discovered this. It’s not a feature that is advertised a lot.

    For some reason though, speech recognition doesn’t work with my voice ( lol ) so it sucked for me.



  6. #6  Thornhillboy
    28th September | Reply

    IRT the thought comment

    Last year I attended a science lecture about that sort of thing. The speaker was Kevin Warwick, and he had been pioneering a way of interfacing with computers through his body.

    He had a computer chip plugged into him, and he was then able to, by clenching and unclenching his hand, control what went on on a computer. He could cycle through the options, and select one. He progressed this by being able to turn music on and off, turn a lamp on and off, and even control a prosthetic hand in New York (he was in Leicester at the time) all through his hand.

    He then showed how he connected himself up to his wife, and she was able to feel him clenching his fist, in her hand, making her want to clench her fist.

    Very interesting.



Archived entry. Read only