Similarity Comparison within words (Python)
I am attempting an ambitious project built around etymology within Germanic languages.
Coming from Holland I speak a germanic language and due to my obsession with the ancient past, I decided to attempt to find the words that bind all germanic languages.
The trouble is that many words have similarities but in different places.
For an example, the Dutch word for travelling is 'reizen' and the German equivalent is 'Reise' and also 'reise' in Norway. But other words have different spellings but similarities in other places.
What I want to ultimately do is type in a word or list of words, it will then look at other languages which use the same type of word and correctly display them, then later I can work at trying to come to a common root.
The trouble is letting the program correctly detect commonalities.
Does anyone have any ideas how I could accomplish this detection?
Take the travelling example.
Others can be a little more difficult. For instance, the Dutch for sailing is 'varen', which isnt similar to the German equivalent but the German for driving is 'fahren', an obvious common descendant.
My ultimate goal is to detect those.
Does anyone have any pointers?
Don't fear the terminal, it may look like a dragon, you just need to learn to ride it.