Collocation

Oliver Brown
— This upcoming video may not be available to view yet.

Collocation refers to a phrase or small group of words used together in normal speech with restrictions not explicitly imposed by grammar.

Quite an odd concept, but vitally important to language learning. Correct use of collocation is probably the best way to identify a native speaker from a near native speaker. I bring this up now because Julia made a wonderful example of an incorrect collocation for English.

She came up with the phrase “two and a half hundred”. In English you can say “two and half thousand” and “two and half million” but for some reason it doesn’t work with hundreds (it does in Finnish incidentally).

Language learning pricing and making it pay for itself

Oliver Brown
— This upcoming video may not be available to view yet.

If I’m considering paying for voice talent for the language app (which let’s face it, I’m going to have to do) I have to be thinking about getting the money back somehow and I have an interesting idea that essentially equates to everyone helping each other learn languages.

Basically when learning, you pay per phrase (phrase in this context means any named element. Usually a phrase but could also be a specific term). The cost would be something really low (say for example $0.01 each). You only pay for a phrase once regardless of how many times it gets repeated or how many conversations it appears in. Just to provide some sort of concrete example, the material I’m testing with (which covers the first two Pimsleur lessons) has 82* different phrases/terms. Quite a few simple conversations can be put together with that material. As the number of phrases increases, the number of conversations increases exponentially (the mathematician within me has to point out that strictly speaking it’s probably not exponential).

The clever part would be to allow people to upload their own audio. Although this has a few issues with regards to quality it might work. If you upload audio, you get a percentage of the money spent on listening to your audio. What the exact percentage should be is complicated though. As well as audio, the other big part of the system is the scripts. The traditional part of me feels that the scripts should have some sort of professional input from someone with experience teaching the language. Another part of me realises that hundreds of books exist for teaching languages that are written by professionals that are totally useless (and therefore professional input may not be all its cracked up to be). With that in mind, someone fluent in the language may be all that is required. Either way, the script writers need money too and should probably get a percentage as well. That $0.01 is being spread quite thin…

The system obviously needs an infrastructure in place to sort this. At a basic level it should list phrases that are needed in scripts but are missing. Another part would be to highlight underutilised phrases that need more conversations writing for them.

As a final note I’d like to explain another bit of cleverness in the system (and an associated problem, perhaps). Before a conversation is played it is checked for completeness - i.e. do sound files exist for all the required audio. Shortly I’ll be adding another layer to this - checking for sound files by the right people. Phrases in a script are marked by “person”. Simply a way of identifying who is doing the talking when a script involves more than one person (which most of them will do). It’s important that all elements in a script marked “Person1” are by the same person and all the parts by “Person2” are by the same person (and that “Person1” and “Person2” are different from each other). This means that there will be some duplication of audio going on. If someone records all the initial audio, but then never records any more, someone else will have to re-record most of it since it will be used in later scripts. Which also implies there’s nothing to stop people re-recording the initial material in an effort to get a percentage of the money (assuming enough is offered to make it worthwhile). I’m not sure if that really is so bad though - it offers more variety for listeners…

* Or perhaps 164. Whether you should pay for the native and foreign versions is a tough subject. I’d like the application to be independent of a specific language so my preference would be “yes”. They take just as much effort to record after all…

How much fluff is needed?

Oliver Brown
— This upcoming video may not be available to view yet.

I’ve been sorting out exactly what needs recording for the language app (which I finally have an idea for a name for) and I was trying to decide how much extra instructor speech is needed. Situations aren’t described for instance (no “Image an English man sitting next to a French woman”) and you aren’t asked to say things explicitly (“How do you ask someone if they speak English?”). Will this harm the process at all?

The best thing to do perhaps would be to avoid trying to be Pimsleur quite so exactly.

Let the research begin…

Oliver Brown
— This upcoming video may not be available to view yet.

As a follow up to my post about becoming a scientist in EVE Online I can now report I am now officially researching.

Unfortunately I don’t think I’ll be generating any cool tech II blueprints for a while though. I am generating 3.8 research points a day. Although I don’t have anything to compare that with, it doesn’t sound like many to me. So right now I’m busy doing agent missions for Lai Dai - the corporation that my research agent belongs to - in an effort to increase standing and therefore the number of RPs I get.

There are lots of little things I’d like to know (and I’ll test and find out if no one tells me) about research. Like:

  • Do any improvements in skills or standing take an immediate effect or do you have to stop researching and start again?
  • Do the points you’ve accumulated stay or disappear when you stop researching?

Any voice talent out there?

Oliver Brown
— This upcoming video may not be available to view yet.

Foreign language voice talent needed for the still-unnamed language learning application.

I have a series of phrases I need recording with a total audio time of about five minutes. I need them in as many languages as possible (although if it isn’t English, German or Finnish then I’ll also need them translating - they are really simple by the way). My main requirements are that the recording is good quality and that you are a native or fluent speaker of the language.

After looking around the Internet for a bit I discovered I could technically afford the hourly rate of most voice actors marketing themselves on the Internet - except they all had rather high minimums which made my five minutes very expensive (although I realise five minutes of audio takes more than five minutes of work). As well as money I can also offer a link and a review which has to be worth something (after all people are paying me for links that have nothing to do with the content of the site and presumably think it’s worth it).

If you’re interested, email me with the language(s) you could do and a quote (and preferably a sample but I acknowledge that this approach is hardly targeting professionals).

TalkTalk service could change

Oliver Brown
— This upcoming video may not be available to view yet.

To all those people with TalkTalk problems, the quality of your service may change.

I found this page on Sam Knows that is apparently a list of telephone exchanges with scheduling dates for TalkTalk unbundling. According to the information there, no exchanges have actually been switched completely to TalkTalk yet. When they do switch there should be a difference in quality. For most people it apparently couldn’t get worse so this should be seen as a good thing.

The date for the changeover is 31/08/06. I’m a little suspicious about the authenticity of those dates since a reasonable amount of physical activity is required to switch over and doing them all at once seems silly.

Multilingual pretty URLs

Oliver Brown
— This upcoming video may not be available to view yet.

There is more and more emphasis on pretty URLs these days. With things like Ruby on Rails around to easily support it and better knowledge and use of things like mod_rewrite the days of horrible query strings is going away (excluding of course the most used websites - search engines). But how do you make your multilingual website have pretty URLs?

My language learning app uses the Zend Framework and so uses pretty URLs by default. I need the interface available in many languages, but then the URLs should be pretty in a localized way.

For example, starting a new Finnish lesson uses the following:

/lesson/new/fi

That would be the new action of the lesson controller with an extra language code parameter of fi.

In German this should be something like:

/lektion/neu/fi

By default this would access the neu action of the lektion controller.

The “simple” solution would be to write lots of controllers that just delegate to the real one. Which is silly. Instead an extra layer has to be added to the routing process some sort of look-up table mapping localized URL fragments with “real” canonical ones. This should be fairly simple with Zend Framework (although I haven’t actually tried yet).

Just an important issue no-one seems to have brought up yet…

Charles Dunstone’s email address

Oliver Brown
— This upcoming video may not be available to view yet.

Some of the people trying to cancel their TalkTalk subscriptions have apparently had success email Charles Dunsonte, Carphone Warehouse’s CEO, directly. Two email addressed have been suggested. I haven’t personally tried either so I can’t guarantee any sort of result.

Unlimited is getting sillier

Oliver Brown
— This upcoming video may not be available to view yet.

The Metro, (the only newspaper I read since it’s free) had an advert from ntl:Telewest for their broadband service that said:

Cable broadband has unlimited downloads…unlike BT, Sky, Orange or TalkTalk

BT and Sky at least (and possibly Orange, not sure) definitely offer “unlimited” packages that are restricted by a fair use policy. The fine print at the bottom of the ad said that the ntl:Telewest also had a fair use policy. So how exactly are they “unlike” each other?

On a related note, the number of replies with TalkTalk complaints is getting really high. The irony is none of the broadband companies have tried competing on quality of service yet.

Galaxia ♠ Renaissance

Oliver Brown
— This upcoming video may not be available to view yet.

Galaxia as a game may be dead, but it’s definitely not forgotten. Head on over to Galaxia ♠ Renaissance to read a lengthy (and getting lengthier) piece of fiction by former Galaxia player, Ashley Rayburn. It contains most of the well known players, the Consortium, the UGC as well of course yours truly, Q. If you don’t know what Galaxia is, it won’t mean much to you, but go read it anyway. It’s funny.