Abstract
AbstractThere have been many published picture corpora. However, more than half of the world’s population speaks more than one language and, as language and culture are intertwined, some of the items from a picture corpus designed for a given language in a particular culture may not fit another culture (with the same or different language). There is also an awareness that language research can gain from the study of bi-/multilingual individuals who are immersed in multilingual contexts that foster inter-language interactions. Consequently, we developed a relatively large corpus of pictures (663 nouns, 96 verbs) and collected normative data from multilingual speakers of Kannada (a southern Indian language) on two picture-related measures (name agreement, image agreement) and three word-related measures (familiarity, subjective frequency, age of acquisition), and report objective visual complexity and syllable count of the words. Naming labels were classified into words from the target language (i.e., Kannada), cognates (borrowed from/shared with another language), translation equivalents, and elaborations. The picture corpus had > 85% mean concept agreement with multiple acceptable names (1–7 naming labels) for each concept. The mean percentage name agreement for the modal name was > 70%, with H-statistics of 0.89 for nouns and 0.52 for verbs. We also analyse the variability of responses highlighting the influence of bi-/multilingualism on (picture) naming. The picture corpus is freely accessible to researchers and clinicians. It may be used for future standardization with other languages of similar cultural contexts, and relevant items can be used in languages from different cultures, following suitable standardization.
Funder
Indian Council of Medical Research
Manipal Academy of Higher Education, Manipal
Publisher
Springer Science and Business Media LLC