avatar Bite 205. Female speakers @ Pycon US

After our Code Challenge 62 / Alicante PyDay last week, we thought it would be nice to branch off a Bite exercise using what we learned. So prepare to do some web scraping using BeautifulSoup and discover a new library called gender_guesser. We are going to look at the percentage of female speakers at Pycon US 2019.

Here is what you need to do:

  1. Complete get_pycon_speaker_first_names extracting all names from PYCON_HTML we cached somewhere for you. Note that some entries have multiple names separated by comma (,) and slash (/), so you will need to extract those. Return a list of first names.
  2. Complete get_percentage_of_female_speakers using gender_guesser.detector's Detector() to determine the gender based on the first names passed in. This tool is not perfect: some names won't be found. However we like Pareto's principle so we're happy to get a rough indication. Return the percentage of female speakers rounded to 2 decimal places.

If next year's Pycon site doesn't change much, you now have a re-usable script you can run against Pycon 2020's data ...

Have fun and keep calm and code in Python!

Login and get coding
go back Intermediate level
Bitecoin 3X

89 out of 89 users completed this Bite.
Will you be Pythonista #90 to crack this Bite?
Resolution time: ~69 min. (avg. submissions of 5-240 min.)
Pythonistas rate this Bite 4.38 on a 1-10 difficulty scale.
» Up for a challenge? 💪

We use Python 3.8