[div align=\"right\"]
[{POST_SNAPBACK}][/a][/div]
Paka,
Could you tell me more about the files for the Korean dictionary that you link to? Are those open source? If so, it might be possible to reformat the dictionary files into zbedic format and use them under zbedic. That would be great for Korean speakers (which I am not).
The dictionary files appear to have been compiled for use by ldic program. Can you find the source files for the files? Can they be decompiled?
[div align=\"right\"][a href=\"index.php?act=findpost&pid=137875\"][{POST_SNAPBACK}][/a][/div]
So I did a little research... it turns out that the dictionary data files for the ldic project most likely were in some way based on the engdic project (which was originally an open source eng-kor dictionary project that was later picked up by some computer/language/processing professor types from Japan who extended the dictionary, indexed it, and made it forwards and backwards useable - i.e. kor-eng and eng-kor). The old page for that project no longer exists, but I have a link to it on the web-archive here:
[a href=\"http://web.archive.org/web/20050308024704/http://www.kecl.ntt.co.jp/icl/mtg/resources/engdic/]http://web.archive.org/web/20050308024704/...sources/engdic/[/url]
So I tried to download the engdic4 files from the archive site, but no success, and no amount of searching - usenet, google, naver (korean google), or anythign would turn up the engdic4 files...
So I emailed the guy who was in charge of the updates to engdic a few years ago. What do you know, I get a reply from Japan (Im currently in Seoul btw) in less than 10 min. Just for kicks, here's some of it:
G'day,
I have been looking everywhere for this file - but it seems to have
vanished off of the web. Do you know where I might find it? Thank
you so much!
Unfortunately, I had some bad new about our Korean-English
dictionary, engdic. We received mail last year from Changwoo Ryu, one
of the debian Korean developers. He said that they now think that the
original distribution of engdic was possibly illegal:
But note that engdic's Korean-English dictionary data is
maybe-illegal-to-distribute. I said "maybe", because nobody knows how
this data were made. The original author just distributed it. And the
author was just a hacker and engdic is just a simple ad-hoc console app,
so everyone think the data came from one of the proprietary
dictionaries.
I don't want to risk distributing something illegal, so I had to take
the engdic site down. However, I think it is ok to share the subset
we annotated with Hanja, so I will attach it to this mail. Sorry I
can't be more help.
--
Francis Bond <www.kecl.ntt.co.jp/icl/mtg/members/bond/>
NTT Communication Science Laboratories | Natural Language Research Group
(I have that hanja file by the way if anyone wants it - just PM me)
SOOOOO....the long and short of it is that it looks like finding open source dictionary files this comprehensive is rather unlikely at this point.
So I am left with 2 options - compile ldic for my zaurus under pdaXqtrom (which is what I'm currently trying to do....but am getting hung up on gtk lib errors) or somehow extract the info from the ldic dictionary files, parse it and restructure it so it would play nice with zbedic.
So anyone who's interested is up to speed, here's the site wher eI am getting the files:
http://download.kldp.net/ldic/I'm not sure the difference between the '1.0.0' versions and the normal versions, and documentation in both korean and english is rather sparse....but the 1.0.0 versions seem to play more nicely when I open them in a text editor.
Of the two 1.0.0 files, the kor-eng side seems to open fine using Korean (not UTF8) encoding. It seems to be in html - indeed, it displays perfectly in a web browser when I add a html open and close tag to it. The eng-kor side, however, won't display everything correctly. I don't understand......
As the dictionary files are (uncompressed) ~24 MB and 66 MB text files, I'm inclined to try and compile ldic...but if anyone knows of a nice script to convert html to zbedic tags, please let me know by all means....
To anyone who made it all the way to the bottom of my rather esoteric post, I really appreciate your interest. Does anyone have any suggestions as to how I should proceed next?
Mahalo nui,
Naupaka