Do they just speak faster? Do the Indian words/pronunciation flow better/faster than English does? And they are simply trying to match the cadence?

  • ilinamorato@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    6 months ago

    Ok, so I heard a thing a long time ago about information density in languages, and that there’s a specific amount of information conveyed per second which is pretty consistent across languages, even when the number of sounds is higher or lower. Which means that a single word in English, for instance, would convey more information than a single word in Hindi.

    Is there anything to that? Or was that just nonsense?

    • merc@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      6 months ago

      Someone posted a link to just that topic here. Apparently almost all languages transmit about 39 bits per second of data. Italians use 9 syllables per second, Germans only about 5-6, but both convey the same amount of information per second. But, not all syllables are equal. Japanese has about 5 bits per syllable, English has about 7 bits per syllable. The most information dense language per syllable is apparently Vietnamese with about 8 bits per syllable.

      Apparently though, the bottleneck is the brain. The end result seems to be that languages that have fewer “bits of data” per syllable say those syllables more quickly, and the ones with fewer bits of data per syllable say those syllables more slowly, so that the average is about 39 bits per second no matter what the language.

      Having said that, I often listen to podcasts sped up to 1.5x speed, and I listen to podcasts while doing other things, so I guess the bottleneck is probably on the sending side rather than the receiving side.

      • YTG123@feddit.ch
        link
        fedilink
        arrow-up
        0
        ·
        6 months ago

        Does anyone know how the amount of information is actually derived? The article just says “researchers calculated”

        • merc@sh.itjust.works
          link
          fedilink
          arrow-up
          0
          ·
          6 months ago

          They were vague about it, but they said something about converting it to computer code. I would guess they just wrote it out as ASCII text and counted how many bits of ASCII equivalent they transmitted. (Of course this ignores intonation and emphasis, but I’d guess they did ignore those.)

          • bleistift2@feddit.de
            link
            fedilink
            English
            arrow-up
            1
            ·
            5 months ago

            If that’s really what they did, it’s stupid. First, you need to find a translation for every language to ASCII, which will wildly skew the results. Second, there are many ways to express the same concept, which all vary wildly in length. Take “Hi”, 2 letters, which means exactly the same as “How are you doing?”, 14 letters.

            • merc@sh.itjust.works
              link
              fedilink
              arrow-up
              1
              ·
              5 months ago

              Take “Hi”, 2 letters, which means exactly the same as “How are you doing?”, 14 letters.

              It’s similar, but not exactly the same by any stretch. But, yeah, it’s not a perfect method. But, there probably isn’t a perfect method. How would you decide what “1 unit of information” is?

              • bleistift2@feddit.de
                link
                fedilink
                English
                arrow-up
                1
                ·
                5 months ago

                How would you decide what “1 unit of information” is?

                I wouldn’t, because I have no knowledge in the field. But since the paper hinges upon that exact definition, and “They were vague about it”, this raises the biggest red flag I’ve seen in science yet.