Here, as well as later in the article, you are working with data with drastically different sample sizes -- summing the table you have about 3M PRs from identified men and about 140k from women, making any one pull request about 20x more likely to be from a man than from a woman.

Have you considered potential impact on this imbalance on your statistical methods?

read more, vote or answer

waiting for moderation