Coursera Knowledge Areas Extraction

Goal: assess which knowledge areas are taught (and under or over-represented) in online courses in Spanish language on the Coursera platform.

Step 1
WordCloud

Create a simple word cloud of words that make up the certificates titles. This gives a broad idea of the topics but, among other things, misses synonyms and does not account for topics made up of more than one word.

Step 2
Clean-up

Prepare vocabularies (ESCO, O*NET, some topics added manually)

Run Spacy Phrase Matcher

Step 3
Network graph...

... using Networkx and Pyvis.

Click here to open the graph.