Università di Bergamo > Cerlis > Versione Italiana
CADIS
Corpus of Academic English
Research project funded bythe Italian Ministry of Research.
As an aid in the analysis ofvariation in intercultural communication, and in the identification of textualvariants arising from the use of English as a first language, second language,or lingua franca of the scientific community, we have devised a corpus formedby English – and in part Italian –texts for academic communication,produced by scholars and academic institutions in various parts of the world.
CADIS can thus enable researchers toanalyse the most significant macro-microlinguistic variants in terms of identity, evaluation andinterpretation in the light of recent linguistic scholarship. More specifically,the data allows an in-depth analysis of the following aspects:
Besides includingtwo alternative languages and representing native as well as non-nativespeakers, CADIS also represents four different disciplinary areas:
For each disciplinaryarea, four different textual genres have been considered:
The structural complexity of CADIS reflectsits contrastive orientation: it is in fact designed to be internally comparable,so as to enable researchers to analyse and contrastthe chosen texts, not only by disciplinary area, genre, language and culture,but also historically. This is possible because the corpus covers a time frameof over 30 years, from 1980 to the present day.
The English texts were taken from morethan 30 peer-reviewed journals available by subscription through the Universityof Bergamo website. Because all the journals selected have a high impactfactor, we are confident that the content of our corpus is highly representativeof each specialised community from which itoriginated. The same principle is followed in the sampling of Italian academictexts, which have been selected from the most important journals available ineach field.
CADIS comprises 2761 academic texts,reaching a total of about 12 million tokens, selected and classified by disciplinaryarea, genre, language, author (i.e. NS/NNS), geographical provenance, date ofpublication and source journal.
Project leader:
Prof. Maurizio Gotti
Project members:
Dr. Ulisse Belotti
Dr. Larissa DAngelo
Dr. Davide Giannoni
Dr. Stefania Maci
Dr. Michele Sala
CADISCorpus (1980-2011)
The Subcorpora
Applied Linguistics (1980 – 1999) ENG
Applied Linguistics(2000 – 2011) ENG
AppliedLinguistics (1980 – 1999) ITA
AppliedLinguistics (2000 – 2011) ITA
Economics (1980 – 1999) ENG
Economics(2000 – 2011) ENG
Economics (1980 – 1999) ITA
Economics(2000 – 2011) ITA
Law (1980 – 1999) ENG
Law (2000 – 2011) ENG
Law (1980 – 1999) ITA
Law (2000 – 2011) ITA
Medicine (1980 – 1999) ENG
Medicine (2000 – 2011) ENG
Medicine (1980 – 1999) ITA
Medicine (2000 – 2011) ITA
Publications