New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
using this library to detect arabic dialects #46
Comments
This is a very interesting idea. Is the spelling of words different from one Arabic dialect to another? |
yes of course |
I think the idea is sound and that it's worth trying. I don't think you would need to make any changes to this library in order to try it. If you run into any trouble generating profiles from your corpora, feel free to post here and I'll be glad to help. |
Sorry but i am new to java , |
Hi how are you |
The easiest way is to use the jar from shuyo's repository. Here's an example of generating an Egyptian Arabic profile from its Wikipedia abstract, using Linux:
Run this process to make sure it works. Then replace the abstract text file with your dialect corpus and run the last step again. |
@odaymard Any progress on this? |
I am using facebook api and twitterapi to get data but facebook4j is slow I am trying to make it faster |
Hi |
Nice! You need to (1) fork this project, (2) add your new profile to your fork, and (3) create a pull request to this project. |
I did that, what next? |
It's up to you. Some suggestions:
|
Nice work, thanks Oday and Robert. I believe that we should include Arabic dialect profiles in the library, and start some separation in profile loading. I suspect that most users who want "all" languages just want one Arabic profile, one Norwegian, one English, one German, not dialects. Dialects is special purpose. |
Hi @odaymard , @rmtheis , @fabiankessler , |
Hi @dansupiriti ,
first of all getting corpora depends on what language you want to add
i advice you to use graphapi to get corpora from facebook group and also
using twitter api is usefull
after getting corpora you have to generate language profile and test it
on your language
good luck
…On Mon, Mar 13, 2017 at 4:49 AM, dansupriti ***@***.***> wrote:
Hi @odaymard <https://github.com/odaymard> , @rmtheis
<https://github.com/rmtheis> , @fabiankessler
<https://github.com/fabiankessler> ,
I have started using the library and it's really helpful, but I might have
to add new language profile, could you please help me from where I can get
the language corpora and what are all steps involved to generate language
profile from language corpora? once I have new language profile how to add
the same in profile folder?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AP0vHBWBzu4jA7TSZ91ne-GMs2ncKRUDks5rlK6-gaJpZM4IJBB1>
.
|
Hi @odaymard , Regards, |
happy to help
…On 1 Apr 2017 01:11, "dansupriti" ***@***.***> wrote:
Hi @odaymard <https://github.com/odaymard> ,
Thanks for the suggestion. Now I am able to add new language profile as
per my requirement.
Regards,
Supriti
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#46 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AP0vHEuDG29dfloVS5oYO_WISUVHchjTks5rrXoIgaJpZM4IJBB1>
.
|
@odaymard Are you willing to publish the other Arabic dialect profiles that you've generated? Apart from the interests of others here, I would like to make a basic free Android app with the profiles you've made. It would be a simple, free app that allows a user to paste in Arabic text and get a dialect estimate based on this library. @safaahenno I think only the Syrian profile is available as of right now. |
@rmtheis Can I use this Lib in Netbeans project, and I get the steps to use this lib in Netbeans without facing problem like "package org.jetbrains.annotations does not exist" because it's not clear in readme file. |
@safaahenno You should open a separate issue for that. This issue pertains to Arabic dialects only. |
@rmtheis Ok, will do |
I am very happy to find this tool
i have a question
can this library help me in detecting arabic dialects (syrian iraqi gulf)
i will try to build corpora for each dialect and add it to language profile
is that right?
The text was updated successfully, but these errors were encountered: