Page 1 of 1

Fix needed: Tonos and Oxia in Greek

PostPosted: Wed Mar 28, 2012 6:27 pm
by C2BlEv
Background:

When they created Greek basic and then Greek extended in Unicode, they duplicated vowels with the diacritic tonos (modern Greek) and oxia (Ancient Greek). Tonos and Oxia must look the same, as an acute accent (´, a small slanted dash above a vowel). Unicode requires that both precomposed tonos and oxia vowels must decompose into the vowel and a combining acute accent, conflating/normalizing the two to the tonos version.

Code: Select all
άέήίόύώ should equal άέήίόύώ
ΆΈΉΊΌΎΏ should equal ΆΈΉΊΌΎΏ


As the result, the typical rule is that the Unicode letters with oxia should not be used, instead the vowels with tonos should be used. However, some fonts (Tachoma, Lucida Grande, etc.) have erroneously described tonos as a small vertical dash, thus distinguishing between tonos and oxia. Given the existence of these fonts, most Ancient Greek documents use the oxia variant of the vowels.

Problem:


GoldenDict appears to normalize oxia vowels into tonos (for example, ἠγάπα becomes ἠγάπα; the difference is in ά) in the article view window. However, it does not conflate the two in the index algorithm (it should). For this reason, the links in articles become broken (the link containing oxia-normalized-to-tonos in the article does not lead to the word with oxia in the index). Moreover, GoldenDict shows identical words separately in the search/look-up window only because of the tonos-oxia difference. So, if there is a number of Greek dictionaries that use different tonos-oxia conventions their articles do not show together.

Fix:

GoldenDict should conflate/normalize all oxia vowels to their tonos equivalents everywhere: in the view window, the index and search. (One must not then use the broken fonts that incorrectly display tonos vowels).


Bibliography:
http://wiki.digitalclassicist.org/Greek ... ted_vowels
http://typophile.com/node/15933
http://www.tlg.uci.edu/~opoudjis/unicod ... kbkgd.html
http://www.unicode.org/charts/normaliza ... Greek.html (Unicode normalization chart)
http://omega.enstb.org/yannis/pdf/amendments2.pdf

Re: Fix needed: Tonos and Oxia in Greek

PostPosted: Wed Mar 28, 2012 9:36 pm
by ikm
Here's a question: if you type in the word with tonos, and the dictionary only contains oxia, will it show oxia matches in the list of matched words? And vice versa - when oxia typed, are tonos shown?

Re: Fix needed: Tonos and Oxia in Greek

PostPosted: Thu Mar 29, 2012 4:19 pm
by C2BlEv
In both scenarios, the articles with both tonos and oxia will be shown but separately (I think diacritics in Greek are ignored in look-up, which is the correct approach). However, the linking between articles is strict with the tonos-oxia distinction. A link with tonos (either originally with tonos or oxia-normalized-to-tonos) will not lead to articles with oxia.

Re: Fix needed: Tonos and Oxia in Greek

PostPosted: Tue Apr 17, 2012 6:14 pm
by jennie
Have you tried activating Greek transliteration (Edit->Dictionaries)? It includes some equivalents that are meant to fix (or at least alleviate) the tonos/oxeia problem, and so far it has worked for me.

Re: Fix needed: Tonos and Oxia in Greek

PostPosted: Sat Jul 06, 2013 6:53 pm
by Tvangeste
I think the problem of headwords normalization has just been fixed in the master branch. Please test and report your results.

No matter what user types in the search field (with tonos or oxia), he/she will find the article. And the links to headwords with tonos/oxia in DSL will work as well.