Original post: https://bsky.app/profile/rebeccamwrites.bsky.social/post/3m6a62btty226
Here’s the dataset and their website if you are interested, there’s more games beyond Final Fantasy that they’ve looked at.
Main findings
- 35% of words were spoken by female characters.
- 29% of characters were female, which suggests the imbalance is driven by a lack of female characters.
- 94% of games had more male dialogue than female dialogue.


This doesn’t seem right for X-2. All three party members are female, the villain for the first couple arcs (LeBlanc) is female. Just by glancing at the script there is not a whole lot of male dialogues
https://www.neoseeker.com/finalfantasyx2/faqs/69600-final-fantasy-x-2-script.html
Maybe Brother’s word count is double-counted? Brother speaks in Al Bhed and the subtitle would show both the Al Bhed and English translation
It looks like the scraper python script pulls the game script from here: https://www.ffcompendium.com/h/faqs/ffx2scriptaschthehated.txt
I’m not familiar with X-2 or their method for parsing the game script, but the project doesn’t look too hard to set up and you can submit issues on their GitHub so if you want to take a look I would say go for it. If they made a mistake I’m sure they would like to know
Looks like all Al Bhed dialogues might be double counted. The script there contains: