I had a quick look at the manifestos of the main parties today, so I thought I’d jot down a few remarks here.
So the first thing I did was to remove all the stop words and then run a frequency distribution across the remaining text, which yielded the following result:
That done I ran a quick trigram colocation across the text. This finds groups of three words, which have a low probability of being next to each other by accident, or just by the nature of the English language. Having found groupings of words, I then took a frequency distribution over them and found the most frequent three words groups, this can help us get a good feel for what ideas are important to the authors of the manifestos. The results are below:
That’s it for now, I’ll post again if I get time to do a little more analysis.