Skip to main content

Corpus Linguistics, Concordance and Data-Driven Learning: An innovative Language Teaching Approach!

The Corpus linguistics refers to a body of text. This text can be written or spoken or a combination of both. Corpora ( plural of corpus) can be based on brief text on a narrow topic or can run into millions of words such as BNC ( British National Corpus, a 100- million words of British English) or Cobuild Corpus.
To access, or make use of a corpus one should use a concordancer to look at linguistic patterns. A concordancer is a software that show instances of words in a body of text. In addition, it allows to show collocations and frequencies of words. This approach can be called Key word in context ( KWIK). Now web-based concordancers are being increasingly available, such as Cobuild and lextutor.
The following screenshots are from the Cobuild web-based Corpus
1. Write the word you want to query in the box, then click “show conc”
2. A pop-up window will appear with the instances of the word
Click here to view it ( better than providing a screenshot :)
3. To use in the classroom, teacher should turn students’ attention to the collocations and usage of the word in authentic language ( since the word is retained from authentic text and not text made specifically for esl/efl). Students then can derive grammatical rules (eg. modal verbs, indefinite pronouns..) and notice how certain vocabulary is used in authentic context. They can deduce what a certain vocabulary collocates with.
ubiquitous concordane
4. The teacher can also use a fill-in-the-blank activity where the word in query is omitted. If internet connection is not available or there are no computers in the classroom, the teacher can distribute them as printout. Note that the teacher should spend time preparing this concordance before presenting it in the classroom to ensure that it targets the intended language use.

Using corpora in the classroom involves making use of concordance software to analyze a corpora ( or web-based concordance such as the above example) and spot patterns and differences in language usage. For instance, students can use corpus linguistics with the aid of a concordance to make error corrections to their writing, or the teacher can show students a certain syntactical or lexical usage for students to induce the rule ( inductive learning), called data-driven learning since it is based on a data analysis that results in linguistic learning. ( check out the father of data-driven learning website, Tim Jones). For more on the idea of data-driven learning click here. Of course, using a concordance and corpora is not easy for students to handle so it is imperative that students practice extensively on deriving or inducing rules from linguistic patterns, or even correct their linguistic and writing error based on a written corpus.
Again, it is important to note that data-drive learning demands extensive practice before employing it as an approach. The role of the teacher becomes that of a manager, orienteer, and observer and the role of the student changes to a researcher of language.
Why use this approach instead of traditional grammar and lexical instruction?
  • It exposes the language learner to authentic language instead of rather fabricated ESL text
  • It changes the role of the language learner from a mere receptive individual into a language researcher ( note that this approach might not work as expected especially with young learners).
  • It ensures a learner-centered classroom without diminishing the role of the teacher
  • It encourages learner autonomy with regard to errors correction ( will be discussed in my next post)
More posts will also discuss more on concordancing, data-driven learning, and corpora. How a teacher can collect a certain corpus for a certain learning context, how a teacher can analyze his/her learners’ linguistic output, such as writing, called learner error analysis,and how to use corpora in more activities in the classroom.
Now, I leave you with some links to concordance software, including web-based, that you can use and play around:
  • antconc Lawrence Anthony’s free concordance software that you can download
  • monoconc pro commercial concordance
  • concordance commercial concordance software that I use
  • lextutor a free web-based concordance
  • Cobuild free web-based concordance and corpus
Next post will discuss how to integrate a concordance in word processor to result in error noticing and learner autonomy.
If you have an queries , need more info, or just want to post feedback please post a comment. Your comments are highly welcome :)

Popular posts from this blog

Edmodo: A Microblogging Educational Platform

I’ve been aware of edmodo for quite a time now though I have never had the chance to use it with my students yet, as the scholastic year did not start yet.
What is Edmodo?
We all know twitter as a social networking platform and a microblogging platform for language learners right!!! The thing is that twitter does not have the security that our students need for safe microblogging. This is where edmodo comes in with its enhanced new features.

Simply put, Edmodo is a microblogging platform for education. You notice this on the home page of edmodo where there you can sign up as a teacher or a student.

Once you enter as a teacher you have to create an account to use edmodo. Your pesonal page contains all the features you need to connect with your students. You can upload assignments with files, link to urls, embed videos, or post a note.
The security in edmodo is that you have to create a group to connect to. Once this is done, you are given a code which in turn you give to your students.…

Emerging Technologies, Key Trends,and Challenges in K- 12 Education

The NMC Horizon 2013 report  came out couple of weeks back with its time-to-adaption of emerging technologies in k-12 education. What New Media Consortium Horizon does is conduct extensive research in the domain of digital learning, and project their probability on the adoption of emerging learning technologies. The report features six technologies with three adoption horizons: 1 year, 2 to 3 years, and 4 to 5 years.The report also includes major trends in the area of digital learning in k-12 education and the major challenges facing education in terms of using technology in education.
Time-to-Adoption for K-12
New-term Horizon (Time-to-adoption 1 year)
Mobile Learning Mobile learning is becoming an essential part in k-12 education. There have already been many initiative programs like the one-to-one and the BYOD programs to help students learn anytime and everywhere. Mobile learning also has more affordance than laptops or PCs for combining the real world and virtual tools in what’s…

No More Text Comments: Giving Voice Feedback on Google Docs

One of the best features of Google Docs is that teachers can comment on students’ essays by highlighting the selected text and giving textual commentary. Students can in turn comment back on their teacher’s comment, making it a great formative feedback for essay writing. It does not only kill the red ink annotations, which are a real annoyance to students, but also targets the intended selection to comment on in an organized manner. However, if teachers want to take formative feedback to new levels of personalized feedback, a voice feedback would be a great solution. This is what Kaizena actually does, and more. Kaizena is a voice commentary online application that integrates fully with Google Drive to maintain the smooth workflow. You don’t even have to go to Kaizena website to install it. It works much like the Google Docs commentary but instead of textual commentary in the highlighted essay section, you include a voice commentary. Kaizena also supports text commentary and highlight…