Linguistics

592 readers
2 users here now

Welcome to the community about the science of human Language!

Everyone is welcome here: from laymen to professionals, Historical linguists to discourse analysts, structuralists to generativists.

Rules:

  1. Stay on-topic. Specially for more divisive subjects.
  2. Post sources whenever reasonable to do so.
  3. Avoid crack theories and pseudoscientific claims.
  4. Have fun!

Related communities:

founded 1 year ago
MODERATORS
1
 
 

Found this article in the longreads community arguing why "politically correct" terms shouldn't be used. You guys have any thoughts?

2
3
 
 

We invite participants to benchmark systems for word sense induction (WSI) across multiple languages. Unlike traditional approaches, this task evaluates WSI without relying on predefined sense inventories, offering a more theoretically plausible framework for understanding word meanings.

Participants will be provided with:

  • A set of polysemous target headwords.
  • Sentences containing these words in diverse contexts.

The goal is to cluster the sentences according to the sense in which the target word is used.

A different set of headwords and contexts will be provided for each of the following languages: English, Czech, German, Spanish, Estonian, Chinese

For each language, there will be approximately 25 headwords with 1500+ contexts each.

4
 
 

cross-posted from: https://quokk.au/post/1499265

What a Christmas present!

Italo-Celtic is a hypothetical branch of the Indo-European languages. If that branch is real, it means that the Italic and Celtic languages are closer to each other than to other Indo-European languages.

This hypothesis has been raised multiple times in the past, due to a few shared morphological features between Italic and Celtic languages; for example, the *-ism̥mo- superlative. But that's on its own weak evidence, so this genomic data makes wonders to reinforce this hypothesis.

And also to bury the competing (IMO rather silly) Italo-Germanic one.

Graeco-Armenian is similar to the above, but between the Hellenic languages and Armenian. There were lots of competing hypotheses "tying" both branches to other "random" Indo-European branches; for example I've seen Indo-Greek, Italo-Greek, Armeno-Germanic, Armeno-Albanian...

5
 
 

As W. Labov has passed away, I came across a comment reposting this screenshotted request, along with the paper in question:

https://betsysneller.github.io/pdfs/Labov1966-Rabbit.pdf

The paper is quite a rollercoaster, ranging from describing of disturbingly racist ideas about native Hawaiian and Black children that some scientists still pushed at the time (1970!*), to Labov's own disarmingly cute and humane solution to the issue of testing children's language abilities.

Edit: *1970 - according to the article itself, which is apparently based on Labov's 1970 talk; however, the URL suggests that the article was published in 1966, which is contradictory. I'll try to find out where and when this was actually published...

Edit 2: It looks like it is from 1970, from Working Papers in Communication, vol. 1 (Honolulu: Pacific Speech Association). It is surprising that a recently published book also claims that it's from 1966, probably the authors got the file from the same URL with the wrong year.

Edit 3: The original Twitter thread: https://xcancel.com/betsysneller/status/1516848959284678656

6
20
William Labov has passed away (languagelog.ldc.upenn.edu)
submitted 1 month ago by [email protected] to c/[email protected]
 
 

William Labov, known far and wide as one of the most influential linguists of the 20th and 21st centuries, passed away this morning at the age of 97, with his wife, Gillian Sankoff, by his side.

Bill is still very alive to us, so many of us, here at Penn. His voice reverberates. Mark is working on a longer, more detailed appreciation.

7
8
9
10
11
12
 
 

Archive link: https://archive.ph/cIz4A

It's dated to be from around 2400 BCE. The article doesn't clarify if it's a true alphabet or an abjad, but either way it's interesting.

EDIT: see also https://www.scientificamerican.com/article/worlds-oldest-alphabet-discovered/ for a less pop-linguistics narrative of the same discovery.

13
14
15
16
 
 

Native English speakers, how do you use personal datives? Today I came across an interesting text on the phenomenon here. Here are some examples from the text:

4] a. I got me some candy.

b. You got you some candy.

c. We got us some candy.

5] a. He got him some candy.

b. She got her some candy.

c. *It got it some candy.

d. They got them some candy.

(5c is marked with * to mark its grammatical unacceptability)

As a non-native speaker, I find sentences (4a) and (4c) to be natural, although I'd probably never use them myself. However, other sentences are odd to me, and seem as if they would cause confusion, they could be interpreted as if the subject got the candy for someone else. (4b), with 'you', is even more odd to my ears, even though a cited study says it is much more common than 3rd person constructions.

How do you perceive these sentences, are they all acceptable/natural to you?

17
18
15
submitted 3 months ago* (last edited 3 months ago) by [email protected] to c/[email protected]
 
 

The dispersal of the Indo-European language family from the third millennium BCE is thought to have dramatically altered Europe’s linguistic landscape. Many of the preexisting languages are assumed to have been lost, as Indo-European languages, including Greek, Latin, Celtic, Germanic, Baltic, Slavic and Armenian, dominate in much of Western Eurasia from historical times. To elucidate the linguistic encounters resulting from the Indo-Europeanization process, this volume evaluates the lexical evidence for prehistoric language contact in multiple Indo-European subgroups, at the same time taking a critical stance to approaches that have been applied to this problem in the past.

Part I: Introduction

Guus Kroonen: A methodological introduction to sub-Indo-European Europe

Part II: Northeastern and Eastern Europe

Anthony Jakob: Three pre-Balto-Slavic bird names, or: A more austere take on Oštir

Ranko Matasović: Proto-Slavic forest tree names: Substratum or Proto-Indo-European origin?

Part III: Western and Central Europe

Paulus S. van Sluis: Substrate alternations in Celtic

Anders Richardt Jørgensen: A bird name suffix *-anno- in Celtic and Gallo-Romance

David Stifter: Prehistoric layers of loanwords in Old Irish

Part IV: The Mediterranean

Andrew Wigman: A European substrate velar “suffix”

Cid Swanenvleugel: Prefixes in the Sardinian substrate

Lotte Meester: Substrate stratification: An argument against the unity of Pre-Greek

Guus Kroonen: For the nth time: The Pre-Greek νϑ-suffix revisited

Part V: Anatolia & the Caucasus

Rasmus Thorsø: Alternation of diphthong and monophthong in Armenian words of substrate origin

Zsolt Simon: Indo-European substrates: The problem of the Anatolian evidence

Peter Schrijver: East Caucasian perspectives on the origin of the word ‘camel’ and some notes on European substrate lexemes

19
 
 

I have a question for folks here, mainly around English linguistics but would love to hear of parallels in other languages. If you're not big on cats, just skip the next paragraph, which I've include for the context to be clear and show why I have provided the picture.

This morning, one of my cats was acting up a bit, hopping on the table where I have an electronics project, and searching for something to pilfer. In order to halt this behavior, I distracted him with a good deal of play with his toys (he is very athletic, so, lots of tossing a toy mouse for him to chase, then walking over to where he's left it because he doesn't fetch anymore). The image is of the culprit now that he's worn out.

While trying to achieve this state, I had a modified aphorism occur to me:

Idle cats are the Devil's playground.

It occurred to me then that I'm not sure if there is an extant term to describe taking an existing aphorism and modifying it while still conveying the same or similar meaning. For those not familiar, the original aphorism is "Idle hands are the Devil's playground" (apparently of biblical origin), meaning roughly that busy people don't often get into trouble or conversely that bored people will get into mischief.

There is a term, if informal, to describe, often intentional, mismatch of parts of aphorisms (ex. "Not the sharpest egg in the attic"), malaphor. Can anyone think of a similar extant term for a modified aphorism? If not, after trying multiple prefixes, I think that the least clunky seems to be "transaphor" (trans- meaning to change).

Anyone have thoughts on the matter?

20
21
 
 

cross-posted from: https://feddit.uk/post/14265279

The upward intonation, the guttural “ck” and even the cheeky comeback to win the argument: at just 19 months old, baby Orla has mastered the crucial elements of speaking like a scouser.

Impressively, the toddler who featured in a viral video this week appears to have done so without the need for actual words.

A clip posted on TikTok, and now viewed more than 20m times, shows Orla babbling in a Liverpudlian accent as her babysitter, Olayka, tries and fails to coax her into taking a nap. Scientists say that the cute exchange is also a vivid illustration of the processes by which babies acquire language – and the surprising role of accents.

Babies are so tuned in to the musical ups and downs of speech that even as newborns they cry in distinctive ways that reflect the languages that they have heard while in the womb.

In one 2009 study, Prof Kathleen Wermke, a pioneer in the field of speech development at the Würzburg University in Germany, found that French infants tend to wail on a rising note and German babies favour a falling melody and other patterns have been seen for Mandarin, Swedish and African languages. “When I started 40 years ago, if I told people I was recording babies crying and making high-pitched sounds they’d look at you and think ‘Is this really science?’,” she said.

22
23
 
 

I'm sharing this here mostly due to the alphabet. The relevant region (Tartessos) would be roughly what's today the western parts of Andalucia, plus the Algarve.

Here are the news in Spanish, for anyone interested.

The number of letters is specially relevant for me - 32 letters. The writing system is a redundant alphabet, where you use different graphemes for the stops, depending on the next vowel; and it was likely made for a language with five vowels, so you had five letters for /p/, five for /t/, five for /k/. Counting the "bare" vowels this yields 20 letters; /m n s r l/ fit well with that phonology, but what about the other seven?

24
25
view more: next ›