I have a pandas data frame where one of the columns is an array of keywords, one row in the data frame would look like
id, jobtitle, company, url, keywords
1, Software Engineer, Facebook, http://xx.xx, [javascript, java, python]
However the number of possible keywords can range from 1 to 40
But I would like to do some data analysis like,
- what keyword appears most often across the whole dataset
- what keywords appear most often for each job title/company
Apart from giving each keyword its own column and dealing with lots of NAN values is there an easy way to answer these questions with python, (permeably pandas as its a dataframe)