TY - CHAP
T1 - What can corpus software do?
AU - Anthony, Laurence
N1 - Publisher Copyright:
© 2022 selection and editorial matter, Anne O’Keeffe and Michael J. McCarthy; individual chapters, the contributors.
PY - 2022/1/1
Y1 - 2022/1/1
N2 - This chapter focuses on the software tools available to researchers interested in carrying out corpus studies. First, the chapter describes the strengths and weaknesses of ready-built online and offline tools and compares them to custom-built do-it-yourself (DIY) tools that usually come in the form of programming scripts. Next, the chapter explains how online, offline, and DIY tools can be effectively used to analyze bottom-up language patterns, such as the word and keyword frequencies, clusters, n-grams, lexical-bundle patterns, and Key-Word-In-Context (KWIC) concordances. Then, the chapter looks at how corpus tools can be used in combination with dedicated tagging and annotation tools to investigate top-down language patterns, including cohesion, register variation, discourse structure, and pragmatic phenomenon. Next, the chapter explains the importance of data interoperability in corpus tools, which allows for data to be imported into a tool and the results from that tool to be exported for use in other tools. Finally, the chapter discusses cases when a researcher might consider programming their own custom corpus tools and introduces several resources to help them create their first scripts.
AB - This chapter focuses on the software tools available to researchers interested in carrying out corpus studies. First, the chapter describes the strengths and weaknesses of ready-built online and offline tools and compares them to custom-built do-it-yourself (DIY) tools that usually come in the form of programming scripts. Next, the chapter explains how online, offline, and DIY tools can be effectively used to analyze bottom-up language patterns, such as the word and keyword frequencies, clusters, n-grams, lexical-bundle patterns, and Key-Word-In-Context (KWIC) concordances. Then, the chapter looks at how corpus tools can be used in combination with dedicated tagging and annotation tools to investigate top-down language patterns, including cohesion, register variation, discourse structure, and pragmatic phenomenon. Next, the chapter explains the importance of data interoperability in corpus tools, which allows for data to be imported into a tool and the results from that tool to be exported for use in other tools. Finally, the chapter discusses cases when a researcher might consider programming their own custom corpus tools and introduces several resources to help them create their first scripts.
UR - http://www.scopus.com/inward/record.url?scp=85140445523&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140445523&partnerID=8YFLogxK
U2 - 10.4324/9780367076399-9
DO - 10.4324/9780367076399-9
M3 - Chapter
AN - SCOPUS:85140445523
SN - 9780367076382
SP - 103
EP - 125
BT - The Routledge Handbook of Corpus Linguistics, Second edition
PB - Taylor and Francis
ER -