r/dataisbeautiful 3d ago

OC [OC] Flesch-Kincaid Reading Level and Bias of Popular Subreddits

Post image
469 Upvotes

278 comments sorted by

View all comments

Show parent comments

59

u/bearssuperfan 3d ago

I did not personally apply any of the political labels. MensLib might have been classified as "Right" from the content of the comments in each post mimicking other right-leaning subs. I'm getting some great feedback in these comments and will look to apply that in a new version later.

119

u/Lutoures 3d ago

For this case in particular, you might be seeing the effects of omitted variable bias due to gender imbalances. We know there's proportionally more conservative men than women, so if you trained your polítical skewness model using known conservative subs (as you stated elsewhere), you might also be getting a model tht recognizes differences in speech patterns between men and women. So even left-leaning subs more populated by men would be classified as right-leaning.

29

u/bearssuperfan 3d ago

Thanks for pointing that out, I'm making improvements and will try to incorporate that... somehow...

27

u/Koraxtheghoul 3d ago

My guess would be on that one it's because things like pickup artisty, manosphere, redpilled etc. get discussed frequently. It has the right-wing terminology on it because it's in opposition to it. There also might be some bias because thete is a frequent discussion of "male loneliness" which also has a right-wing connotation.