Words with repeated sounds
A class of words in Japanese that I especially like are the ones where the first half is repeated, like しばしば or ゴロゴロ. I think they just sound very nice.
I am not sure, if there is a specific name for these words or if there are specific subsets, but in this list, I looked for all words with the above characteristic. There are two relaxation conditions:
- The repeated sound is often softened, for example in 人々(ひとびと), where the second ひ is changed to び. In the relaxed rule, the repeated part might begin with a dakuten ゛or a handakuten ゜variant of the same character.
- Some words might contain an additional character at the end, for example くれぐれも. So the second relaxed rule is, that words are allowed an additional character at the end.
I think, if kanji are available, these words are generally comprised of repeated ones, like the above mentioned 人々. Due to the reading-only approach for searching, there might be some words that obey this reading, but use different kanji for each part, for example 高校 (こうこう). I am not sure, if these should or should not be considered a different type of words. At the same time, there are some words without kanji (or at least not in the dataset) like ゴロゴロ. I want to include these and to me, the sound itself is what makes these words fun, that's why the rules are pretty relaxed.
Words in the list are sorted according to one frequency measure, with the more frequent ones coming first.
Click this link to see the list
Limitiations
There is very limited processing during the generation and as such there are some limitations to keep in mind.
First of all, this list was created by a non-linguist Japanese learner based on things I found interesting or difficult during learning and might contain inaccuracies due to ignorance.
Similarly, while the entries are sorted by the frequency (according to the nfxx field) of the associated entry. This is neither the only nor the most sophisticated order. This only influences the list order though, so this shouldn't cause any issues.