TY - CONF
T1 - Lower and higher estimates of the number of “true analogies” between sentences contained in a large multilingual corpus
AU - Lepage, Yves
N1 - Funding Information:
The research reported here was supported in part by a contract with the National Institute of Information and Communications Technology entitled?A study of speech dialogue translation technology based on a large corpus?.
Publisher Copyright:
© 2004 COLING 2004 - Proceedings of the 20th International Conference on Computational Linguistics. All rights reserved.
PY - 2004
Y1 - 2004
N2 - The reality of analogies between words is refuted by noone (e.g., I walked is to to walk as I laughed is to to laugh, noted I walked : to walk :: I laughed : to laugh). But computational linguists seem to be quite dubious about analogies between sentences: they would not be enough numerous to be of any use. We report experiments conducted on a multilingual corpus to estimate the number of analogies among the sentences that it contains. We give two estimates, a lower one and a higher one. As an analogy must be valid on the level of form as well as on the level of meaning, we relied on the idea that translation should preserve meaning to test for similar meanings.
AB - The reality of analogies between words is refuted by noone (e.g., I walked is to to walk as I laughed is to to laugh, noted I walked : to walk :: I laughed : to laugh). But computational linguists seem to be quite dubious about analogies between sentences: they would not be enough numerous to be of any use. We report experiments conducted on a multilingual corpus to estimate the number of analogies among the sentences that it contains. We give two estimates, a lower one and a higher one. As an analogy must be valid on the level of form as well as on the level of meaning, we relied on the idea that translation should preserve meaning to test for similar meanings.
UR - http://www.scopus.com/inward/record.url?scp=33847301011&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33847301011&partnerID=8YFLogxK
M3 - Paper
AN - SCOPUS:33847301011
T2 - 20th International Conference on Computational Linguistics, COLING 2004
Y2 - 23 August 2004 through 27 August 2004
ER -