Skip to main content

Corpus Linguistics, Concordance and Data-Driven Learning: An innovative Language Teaching Approach!

The Corpus linguistics refers to a body of text. This text can be written or spoken or a combination of both. Corpora ( plural of corpus) can be based on brief text on a narrow topic or can run into millions of words such as BNC ( British National Corpus, a 100- million words of British English) or Cobuild Corpus.
To access, or make use of a corpus one should use a concordancer to look at linguistic patterns. A concordancer is a software that show instances of words in a body of text. In addition, it allows to show collocations and frequencies of words. This approach can be called Key word in context ( KWIK). Now web-based concordancers are being increasingly available, such as Cobuild and lextutor.
The following screenshots are from the Cobuild web-based Corpus
1. Write the word you want to query in the box, then click “show conc”
2. A pop-up window will appear with the instances of the word
Click here to view it ( better than providing a screenshot :)
3. To use in the classroom, teacher should turn students’ attention to the collocations and usage of the word in authentic language ( since the word is retained from authentic text and not text made specifically for esl/efl). Students then can derive grammatical rules (eg. modal verbs, indefinite pronouns..) and notice how certain vocabulary is used in authentic context. They can deduce what a certain vocabulary collocates with.
ubiquitous concordane
4. The teacher can also use a fill-in-the-blank activity where the word in query is omitted. If internet connection is not available or there are no computers in the classroom, the teacher can distribute them as printout. Note that the teacher should spend time preparing this concordance before presenting it in the classroom to ensure that it targets the intended language use.

Using corpora in the classroom involves making use of concordance software to analyze a corpora ( or web-based concordance such as the above example) and spot patterns and differences in language usage. For instance, students can use corpus linguistics with the aid of a concordance to make error corrections to their writing, or the teacher can show students a certain syntactical or lexical usage for students to induce the rule ( inductive learning), called data-driven learning since it is based on a data analysis that results in linguistic learning. ( check out the father of data-driven learning website, Tim Jones). For more on the idea of data-driven learning click here. Of course, using a concordance and corpora is not easy for students to handle so it is imperative that students practice extensively on deriving or inducing rules from linguistic patterns, or even correct their linguistic and writing error based on a written corpus.
Again, it is important to note that data-drive learning demands extensive practice before employing it as an approach. The role of the teacher becomes that of a manager, orienteer, and observer and the role of the student changes to a researcher of language.
Why use this approach instead of traditional grammar and lexical instruction?
  • It exposes the language learner to authentic language instead of rather fabricated ESL text
  • It changes the role of the language learner from a mere receptive individual into a language researcher ( note that this approach might not work as expected especially with young learners).
  • It ensures a learner-centered classroom without diminishing the role of the teacher
  • It encourages learner autonomy with regard to errors correction ( will be discussed in my next post)
More posts will also discuss more on concordancing, data-driven learning, and corpora. How a teacher can collect a certain corpus for a certain learning context, how a teacher can analyze his/her learners’ linguistic output, such as writing, called learner error analysis,and how to use corpora in more activities in the classroom.
Now, I leave you with some links to concordance software, including web-based, that you can use and play around:
  • antconc Lawrence Anthony’s free concordance software that you can download
  • monoconc pro commercial concordance
  • concordance commercial concordance software that I use
  • lextutor a free web-based concordance
  • Cobuild free web-based concordance and corpus
Next post will discuss how to integrate a concordance in word processor to result in error noticing and learner autonomy.
If you have an queries , need more info, or just want to post feedback please post a comment. Your comments are highly welcome :)


Post a Comment

Popular posts from this blog

Edmodo: A Microblogging Educational Platform

I’ve been aware of edmodo for quite a time now though I have never had the chance to use it with my students yet, as the scholastic year did not start yet.
What is Edmodo?
We all know twitter as a social networking platform and a microblogging platform for language learners right!!! The thing is that twitter does not have the security that our students need for safe microblogging. This is where edmodo comes in with its enhanced new features.

Simply put, Edmodo is a microblogging platform for education. You notice this on the home page of edmodo where there you can sign up as a teacher or a student.

Once you enter as a teacher you have to create an account to use edmodo. Your pesonal page contains all the features you need to connect with your students. You can upload assignments with files, link to urls, embed videos, or post a note.
The security in edmodo is that you have to create a group to connect to. Once this is done, you are given a code which in turn you give to your students.…

Moodle 2 Interactive Tool Guide for Teachers

Moodle has been at the forefront of online learning for learning institutions. And, since it is open source, and free for all, it is common that the community that benefit from Moodle to give back in various ways. One such help comes in Moodle Tool Guide for Teachers. It was first done by Joyce Seitzinger, and then adapted to Moodle 2 by Sue Harper.

I have added the feature of interactivity to the guide however. By adding videos to the tool, anyone who wants to learn how to use any tool can just click on the interactive layer and watch the video. I surely hope this helps teachers learn Moodle tools easily and know how each tool affords different learning outcome.

I will hopefully later add more interactivity in terms of instructional design, such as Bloom's taxonomy, assessing learning etc.

The 4 E model for Pedagogical Technology

If you look at the image on the left, you will notice the four key components of technology integration, what Collis & moonen (2002) refer to as key components of “flexible learning in a digital world”. The components ( institution, implementation, pedagogy, and technology) are nested in each other, which means that each depends on and feeds from the other. The approach for the flexible learning can be seen in a top-down, that institution-wide to technological aspect, or in  a bottom-up, that is from the technological aspect all the way up to the institution.

The 4 E Model as a Guide: Collis & Moonen suggest a 4 E model that will guide anyone who wants to integrate flexible learning in each and all key components Ease of use Environment Personal Engagement Educational Effectiveness
Now, I will not go into details of the activity-flexibility abstract framework and its U pedagogical approach. All I want to focus on here is how the 4 E model would guide the teachers in adopting/adapting …