Here is the link to the Grimm text : Grimm Fairytales and the home page of Project Gutenburg
If you would like to see our repository, which houses our code for this project, click here
We have pulled the most used words in the following categories from the entire Grimm Document: animals, colors, and numbers.
In Grimm fairy tales, and tales written by other authors, it is common to see repeating "buzzwords" in multiple tales. These words can be in a number of categories, but our team found it most interesting and useful to look at a few distinct categories. Below, you will see three graphs made by utilizing SVG that display the information we found. To build these graphs, we used RegEx in Oxygen to find and tag all uses of numbers, animals, and colors. Once everything had a tag, we could then pull up a number of how many times the words were used within the document. Using this information, we were able to use SVG coding to output graphs of our word analysis, seen below.
We also compiled bar graphs for the top 10 verbs used in the document, and all verbs used more than 20 times.
While reading any type of literature, you can see the importance of verbs used in the writing. Verbs can set the tone of a piece, and looking just at a list of verbs in a document can give you an interesting look at the tone and content of a work.
Below are graphs of the top 20 verbs we found used in the Grimm tales, and also all verbs that were used 20 or more times.