On concord.
Moderators: Dimitris, maneki neko2, Lorikeet, Enrico Palazzo, superpeach, cecil2, Mr. Kalgukshi2
-
- Posts: 1421
- Joined: Sun May 18, 2003 5:25 pm
Here's an extract from an e-mail I got from Costa Gabrielatos, linguist:Stephen Jones wrote:Fabulous! Now I don't have to suffer the "you can find anything on Googe" arguments in favour of arbitrary prescriptivism.
1. Some of the free online concordancers suggested by posters use
the web as a corpus. Granted, the web is much larger than the
largest corpus. To get an idea, consider that Google searches 8
billion web pages, which translates into trillions of words,
whereas the largest corpus (the Bank of English) has just above
half a billion words. However, the web is not representative of
any given language variety, medium or genre - in fact, it
contains both NS and NNS English.
Have fun at the link.
-
- Posts: 3031
- Joined: Tue Oct 26, 2004 6:57 pm
- Location: UK > China > Japan > UK again
-
- Posts: 1421
- Joined: Sun May 18, 2003 5:25 pm
-
- Posts: 1421
- Joined: Sun May 18, 2003 5:25 pm
The proportion of web pages written by North Americans dwarfs that of the other native speaers, and, as long as you exclude the autotranslations, comepletey overshadows the number produced by Non-Native speakers.
Now, where you are likely to get into problems are when you simply use the statistics of the concordancer, wothout direct reference to the links.
Now, where you are likely to get into problems are when you simply use the statistics of the concordancer, wothout direct reference to the links.