Transitional probability predicts native and non-native use of formulaic sequences

Randy Fred Appel, Pavel Trofimovich

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Formulaic sequences (FSs), or prefabricated multi-word structures (e.g. on the other hand), are often difficult to identify objectively, and current corpus-driven methods yield structurally incomplete, overlapping, or overly extended structures of questionable psychological validity and pedagogical usefulness. To address these limitations, this study evaluated transitional probability as a potential metric to improve the identification of FSs by presenting 100 four-word sequences from the British National Corpus, varying in transitional probabilities between words, to native and non-native speakers of English (N = 293) in a sequence completion task (e.g. for the sake__). Results revealed that the application of transitional probability reduces many of the problems associated with current approaches to FS identification and can produce lists of FSs that are more functionally salient and psychologically valid.

Original languageEnglish
Pages (from-to)24-43
Number of pages20
JournalInternational Journal of Applied Linguistics (United Kingdom)
Volume27
Issue number1
DOIs
Publication statusPublished - 2017 Mar 1
Externally publishedYes

Fingerprint

Formulaic Sequences
Word Structure
Psychological
Incomplete
British National Corpus
Completion
Salient
Usefulness
Non-native Speakers of English

Keywords

  • corpus-driven research
  • formulaic language
  • formulaic sequences
  • lexical bundles
  • n-grams

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Cite this

Transitional probability predicts native and non-native use of formulaic sequences. / Appel, Randy Fred; Trofimovich, Pavel.

In: International Journal of Applied Linguistics (United Kingdom), Vol. 27, No. 1, 01.03.2017, p. 24-43.

Research output: Contribution to journalArticle

@article{9eddc472aaae4e36854aa1ff4beb48fe,
title = "Transitional probability predicts native and non-native use of formulaic sequences",
abstract = "Formulaic sequences (FSs), or prefabricated multi-word structures (e.g. on the other hand), are often difficult to identify objectively, and current corpus-driven methods yield structurally incomplete, overlapping, or overly extended structures of questionable psychological validity and pedagogical usefulness. To address these limitations, this study evaluated transitional probability as a potential metric to improve the identification of FSs by presenting 100 four-word sequences from the British National Corpus, varying in transitional probabilities between words, to native and non-native speakers of English (N = 293) in a sequence completion task (e.g. for the sake__). Results revealed that the application of transitional probability reduces many of the problems associated with current approaches to FS identification and can produce lists of FSs that are more functionally salient and psychologically valid.",
keywords = "corpus-driven research, formulaic language, formulaic sequences, lexical bundles, n-grams",
author = "Appel, {Randy Fred} and Pavel Trofimovich",
year = "2017",
month = "3",
day = "1",
doi = "10.1111/ijal.12100",
language = "English",
volume = "27",
pages = "24--43",
journal = "International Journal of Applied Linguistics (United Kingdom)",
issn = "0802-6106",
publisher = "Wiley-Blackwell",
number = "1",

}

TY - JOUR

T1 - Transitional probability predicts native and non-native use of formulaic sequences

AU - Appel, Randy Fred

AU - Trofimovich, Pavel

PY - 2017/3/1

Y1 - 2017/3/1

N2 - Formulaic sequences (FSs), or prefabricated multi-word structures (e.g. on the other hand), are often difficult to identify objectively, and current corpus-driven methods yield structurally incomplete, overlapping, or overly extended structures of questionable psychological validity and pedagogical usefulness. To address these limitations, this study evaluated transitional probability as a potential metric to improve the identification of FSs by presenting 100 four-word sequences from the British National Corpus, varying in transitional probabilities between words, to native and non-native speakers of English (N = 293) in a sequence completion task (e.g. for the sake__). Results revealed that the application of transitional probability reduces many of the problems associated with current approaches to FS identification and can produce lists of FSs that are more functionally salient and psychologically valid.

AB - Formulaic sequences (FSs), or prefabricated multi-word structures (e.g. on the other hand), are often difficult to identify objectively, and current corpus-driven methods yield structurally incomplete, overlapping, or overly extended structures of questionable psychological validity and pedagogical usefulness. To address these limitations, this study evaluated transitional probability as a potential metric to improve the identification of FSs by presenting 100 four-word sequences from the British National Corpus, varying in transitional probabilities between words, to native and non-native speakers of English (N = 293) in a sequence completion task (e.g. for the sake__). Results revealed that the application of transitional probability reduces many of the problems associated with current approaches to FS identification and can produce lists of FSs that are more functionally salient and psychologically valid.

KW - corpus-driven research

KW - formulaic language

KW - formulaic sequences

KW - lexical bundles

KW - n-grams

UR - http://www.scopus.com/inward/record.url?scp=85014954800&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85014954800&partnerID=8YFLogxK

U2 - 10.1111/ijal.12100

DO - 10.1111/ijal.12100

M3 - Article

AN - SCOPUS:85014954800

VL - 27

SP - 24

EP - 43

JO - International Journal of Applied Linguistics (United Kingdom)

JF - International Journal of Applied Linguistics (United Kingdom)

SN - 0802-6106

IS - 1

ER -