Author Topic: Korean-english Dictionary (ldic)  (Read 17335 times)

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #15 on: September 18, 2006, 05:56:55 am »
Quote
So I have an offer, if anyone can get the dictionary files from this site (they are for a program that is open source with code on the site):
http://kldp.net/frs/?group_id=73
into zbedic format, I will pay them $100 for their time....just a little incentive to maybe contribute something to the community.  I'd do it myself, but my C skills are kind of lacking....
Anyone who thinks they would be willing to take on the project, PM and we can work out the details...

It really isn't that difficult.

I have converted several Thai dictionaries to zbedic format and am in process of converting some more. I write a PERL script to convert from the original format to zbedic basic format and then use the programs supplied with bedic to make it into a dictionary file.

I use PERL to extract the different parts of the word definition and generate the output.

However, the difficult bit is that you need to know something about the language in order to make a decent quality conversion.
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #16 on: September 18, 2006, 06:10:26 am »
OK, I had a quick look at the dictionary file and the ldic source. It doesn't look difficult to extract the definitions from the dictionary. However, the original dictionary file looks like it might be in mark up but I can't quite tell if the coding is UTF-8 or something Korean specific.

I think it would be quite easy to hack a short program to load the dictionary and spew out bedic simple format but you'd need to know more about the character coding in order to complete the conversion.
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

ttkman

  • Guest
Korean-english Dictionary (ldic)
« Reply #17 on: September 18, 2006, 08:53:56 am »
Quote
OK, I had a quick look at the dictionary file and the ldic source. It doesn't look difficult to extract the definitions from the dictionary. However, the original dictionary file looks like it might be in mark up but I can't quite tell if the coding is UTF-8 or something Korean specific.

I think it would be quite easy to hack a short program to load the dictionary and spew out bedic simple format but you'd need to know more about the character coding in order to complete the conversion.
[div align=\"right\"][a href=\"index.php?act=findpost&pid=141716\"][{POST_SNAPBACK}][/a][/div]

Dear koan,

as you might have read in the hole thread before, we know how to make that dic. The problem is just to extract it from the ldic-source. And that is problematic first related to the lack of C-knowledge by those who are interested in this dictionary. So if you could hack something together, do it  .. we will be quite thankful.

Btw. I made a zbedic-dic from that pdf I wrote about ... its not perfect yet, but quit usable ... if someone wants it ... send me a pm ... also I do have "other" korean-engl. dics ... just got them from a nice korean guy. But all together they are quit big and as I don't know, whether they are copyright-protected or not, I intend to not share them officially. So if someone is interested, send me a pm too ... we will work out a way.

greetings
Thomas

btw: I will be in Japan from WED this week, I don't know when I will be able to get a line ... so please be patient.

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #18 on: September 19, 2006, 08:56:11 am »
Quote
as you might have read in the hole thread before, we know how to make that dic. The problem is just to extract it from the ldic-source. And that is problematic first related to the lack of C-knowledge by those who are interested in this dictionary. So if you could hack something together, do it  .. we will be quite thankful.

I managed to compile the ldic program although I got a bit confused because I don't have a Korean font (couldn't see any output). Apart from that it looks OK, very basic GUI.

I'm thinking that extracting the dictionary info should be straightforward but putting it in a sensible bedic file might be tricky because I only know 2 phrases in Korean.

koan
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

ttkman

  • Guest
Korean-english Dictionary (ldic)
« Reply #19 on: September 19, 2006, 09:29:18 am »
if we are able to extract the hole dic out of the ldic-program, we have a basic structure, then we should be able to use a bash or perl script to convert it ... but really, my C-skillz are sooo bad ... I would be glad if you perhaps just could review the ldic-code and change it to send the hole data to stdout ... I don't know if you have time or not to do this, or if your skills are good enough. Perhaps i will try it sometimes by myselfe, but right now I am bothered with learning korean and japanese, so I don't really have time to focus myselfe on that.

So ppl, please do something ...

thomas

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #20 on: September 20, 2006, 04:39:01 am »
Quote
So ppl, please do something ...

I'll have a go but it's not top priority for me - I'm converting 3 Thai dictionaries at the moment.

By the way, how big is the "small" dictionary already available from the bedic site ?

koan
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #21 on: September 30, 2006, 04:59:04 pm »
Hi guys

Here is a screenshot from the current status:

[img]http://www.lyndonhill.com/Zaurus/en-ko-dic.png\" border=\"0\" class=\"linked-image\" /]

Can someone tell me if it's somewhere near correct ?
This is a quick attempt to parse the file so it doesn't utilise all the
bedic features, hence some strange things like "2. 1." etc.

thanks

koan
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

kurochka

  • Sr. Member
  • ****
  • Posts: 301
    • View Profile
Korean-english Dictionary (ldic)
« Reply #22 on: October 02, 2006, 02:14:29 pm »
ttkman,

Where can I find the dictionaries that you mentioned earlier?
SL-C3100 (from PriceJapan.com): modified Sharp Rom (couldn't make Japanese input work in Cacko Rom)

ex-SL-C3000; ex-SL-5600; ex-Simpad

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #23 on: October 16, 2006, 06:02:58 pm »
Check the previous post screenshot, it has been updated.
(Work in Progress)

cheers

koan
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

kurochka

  • Sr. Member
  • ****
  • Posts: 301
    • View Profile
Korean-english Dictionary (ldic)
« Reply #24 on: October 17, 2006, 04:22:56 pm »
Quote
Hi guys

Here is a screenshot from the current status:

SL-C3100 (from PriceJapan.com): modified Sharp Rom (couldn't make Japanese input work in Cacko Rom)

ex-SL-C3000; ex-SL-5600; ex-Simpad

coklat

  • Newbie
  • *
  • Posts: 11
    • View Profile
    • http://
Korean-english Dictionary (ldic)
« Reply #25 on: October 19, 2006, 07:24:59 am »
Quote
Hi guys

Here is a screenshot from the current status:

 Where can I download the file? I would like to try it. Thanks
SL-C1000 owner, jumping around between Cacko, OZ & pdaXrom...
Most usable: Cacko, most fun: pdaXrom
My ideal Zaurus: Powerful like a notebook and easy to use like Palm PDA
Just a user though ;)

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #26 on: October 24, 2006, 02:17:31 pm »
Quote
Where can I download the file? I would like to try it. Thanks
[div align=\"right\"][a href=\"index.php?act=findpost&pid=144342\"][{POST_SNAPBACK}][/a][/div]

There are still many issues to fix that are not visible in the screenshot.

I am trying to sensibly separate the different sub-senses, parts of speech, categories etc. by developing a set of rules for the script that does the conversion. Also, I am trying to make the best conversion between the original format and bedic format.

Please understand, I think it is better to do a good job of the conversion rather than upload a half baked mess that gets distributed widely. Do it once, properly and everyone can use a good quality dictionary.

It may take a little bit of time but the wait will be worth it.

koan
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM

kurochka

  • Sr. Member
  • ****
  • Posts: 301
    • View Profile
Korean-english Dictionary (ldic)
« Reply #27 on: October 24, 2006, 03:54:56 pm »
Quote
There are still many issues to fix that are not visible in the screenshot.

I am trying to sensibly separate the different sub-senses, parts of speech, categories etc. by developing a set of rules for the script that does the conversion. Also, I am trying to make the best conversion between the original format and bedic format.

Please understand, I think it is better to do a good job of the conversion rather than upload a half baked mess that gets distributed widely. Do it once, properly and everyone can use a good quality dictionary.

It may take a little bit of time but the wait will be worth it.

koan
[div align=\"right\"][a href=\"index.php?act=findpost&pid=144705\"][{POST_SNAPBACK}][/a][/div]

That's the best approach.  Good luck.  

Do you think your script could be useful for other attempts to convert other formats into zbedic format?
« Last Edit: August 29, 2008, 05:32:44 pm by kurochka »
SL-C3100 (from PriceJapan.com): modified Sharp Rom (couldn't make Japanese input work in Cacko Rom)

ex-SL-C3000; ex-SL-5600; ex-Simpad

koan

  • Sr. Member
  • ****
  • Posts: 370
    • View Profile
    • http://www.lyndonhill.com
Korean-english Dictionary (ldic)
« Reply #28 on: January 02, 2007, 09:34:45 am »
Hi

paka and I managed to finish the conversion of these dictionary files.

If you are interested in downloading, please go to my Zaurus Dictionaries Page.

thanks

koan
Zocalo Feed Reader : Thai on Zaurus : Dictionaries for zbedic : Sharp ROM package feed
HELUX Handheld Embedded Linux Blog
SL-C3200 Multiboot : SL-C750  Sharp ROM