-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cleaning code #19
Comments
https://github.com/oudalab/Arabic-NER/blob/master/explore_traingdata.ipynb it is at the end of this ipynb |
here is the command for transfer ontoNotes format to BILOU format
|
@khaled I will post the ontoNotes raw data to you tomorrow it is on my lab computer. |
@ahalterman Hi Andy do you still have the LDC raw data, I did not find it on my local, did not remember where I put it, we can give that to Khaled for him to take a look. |
Just sent you and Khaled a message. |
@khaled @ahalterman And after that I merge the tag into common ones, with the tag label both in anercorp and LDC ar_eval_all.json ar_eval_all_cleaned.json |
@YanLiang1102, FYI you're mentioning the wrong khaled - I have no connection with this project :-) |
@khaledJabr Hey Khaled I hope u saw the stuff, I mentioned a wrong Khaled, :P |
@YanLiang1102, can you post the code that produces
combined_cleaned_removed
(from exp 5)? Then @khaledJabr can take a look and we can make sure all the data's in the right/same format.The text was updated successfully, but these errors were encountered: