Programming for Corpus Linguistics with Python and Dataframes

Programming for Corpus Linguistics with Python and Dataframes

EnglishPaperback / softbackPrint on demand
Keller Daniel
Cambridge University Press
EAN: 9781108822589
Print on demand
Delivery on Thursday, 13. of February 2025
CZK 497
Common price CZK 552
Discount 10%
pc
Do you want this product today?
Oxford Bookshop Praha Korunní
not available
Librairie Francophone Praha Štěpánská
not available
Oxford Bookshop Ostrava
not available
Oxford Bookshop Olomouc
not available
Oxford Bookshop Plzeň
not available
Oxford Bookshop Brno
not available
Oxford Bookshop Hradec Králové
not available
Oxford Bookshop České Budějovice
not available
Oxford Bookshop Liberec
not available

Detailed information

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.
EAN 9781108822589
ISBN 1108822584
Binding Paperback / softback
Publisher Cambridge University Press
Publication date June 20, 2024
Pages 114
Language English
Dimensions 229 x 152 x 6
Country United Kingdom
Authors Keller Daniel
Illustrations Worked examples or Exercises
Series Elements in Corpus Linguistics