Sections below describe how to get Lexicoder, the Lexicoder Sentiment Dictionary and some related topic dictionaries, Supporting Files useful for those learning how to use Lexicoder with R, and additional files and links useful for anyone downloading and reformatting content for use with Lexicoder, or other automated content-analytic software.

Lexicoder Software

Lexicoder 2.0 was freely available upon request to academic researchers from summer 2009 to summer 2015. As of August 2015, we have replaced the old software with Lexicoder 3.0. The new software is more reliable, it includes a greater number of functions, and it can deal with larger bodies of data. The interface is very different, however: 2.0 was accessible only through a basic Java-based interface; 3.0 is designed to be accessible through R, and from the Command Line. 

For the freely-available version of Lexicoder, please look over the user agreement, and then download the software. The Lexicoder manual and practise datasets are available separately, below.

Citation: Daku, M., Soroka, S., and Young, L. 2015. Lexicoder, version 3.0. Available at:

The Lexicoder Sentiment Dictionary

The LSD has been freely available upon request to academic researchers from fall 2011. As of August 2015, we have updated the format of the LSD so that it works with Lexicoder 3.0.

For the current version of the LSD, please look over the user agreement and then download the dictionary. Note that the dictionary is already included in quanteda in R.

Citation: Young, L. and Soroka, S. 2012. Lexicoder Sentiment Dictionary. Available at

A French-language version of the LSD is also available, produced by Duval and Petry, here

Note that the quanteda package in R has a very straightforward implementation of the LSD.  And the pre-processors to improve the performance of the LSD have been adapted for implementation in R, by Emily Luxon (University of Michigan at Dearborn), and can be downloaded as a single R script by clicking here.

Lexicoder Topic Dictionaries

We worked at one point on a series of topic dictionaries, aimed at capturing topics in news content, legislative debates, and policy documents. Our goal was to capture Major Topic codes from the comparative Policy Agendas project, in multiple languages. We developed preliminary dictionaries in English, Dutch, and Hebrew, in conjunction with the INFOPOL project.

We are not continuing with the development of these dictionaries, but they have served a useful starting point for researchers interested in capturing topics. The current versions of English and Dutch dictionaries can be downloaded below. If you may additions and revisions and would like to distribute those changes here, please do let us know.

Lexicoder Topic Dictionaries (English and Dutch, June 2013, Lexicoder 2.0 versions)

Citation: Albugh, Quinn, Julie Sevenans and Stuart Soroka. 2013. Lexicoder Topic Dictionaries, June 2013 versions, McGill University, Montreal, Canada. (available at

Supporting Files

Lexicoder 3.0 Users Manual

And example workflow in R. This zip file includes an R script file, sample dataset and dictionary files. It is intended to provide some simple examples of how to integrate Lexicoder into a R-based content analysis.

Additional Supporting Scripts, Software, Links

Simon Kiss has developed scripts that can be integrated in R (but needn’t be), designed to reformat media data downloaded from ProQuest archives. You can get the script at GitHub here.