AutoSuggest-Wörterbuch

Hi,

I need an AutoSuggest-Wörterbuch for German-Arabic. Can you tell me please, how I can get it?

I do not know how I can find it.

emoji
Parents Reply
  •   

    TM's are normally personalised with the material you translate.  So you would need to translate more!

    But if you desperately just want to build a larger TM and don't care about the quality or context being something you have personally checked yourself... in which case machine translation might be a better option anyway... then try getting a corpus from here: https://opus.nlpl.eu/

    The image shows a screenshot of a web page for searching and downloading language resources. It presents a table with rows for different corpora: "CCMatrix v1", "OpenSubtitles v2018", and "LinguaTools-WikiTitles v2014". The columns display various statistics for each corpus, such as the number of documents ("doc's"), sentences ("sent's"), tokens, and tokens delexicalised ("de tokens"), as well as the availability of XCES/XML format. Additionally, there are columns indicating the availability of different file types and tools associated with each corpus, including TMX (Translation Memory eXchange), Moses (a statistical machine translation system), mono (monolingual), raw, ud (universal dependencies), alg (word alignments), dic (dictionary), and freq (frequency lists). There is also a column for "other files" with links to "sample" files. The languages focused on are Arabic ("ar") and German ("de"). Users can select different language combinations and filter by size above the table. There are options for "show all versions" and to download the data in different formats indicated by mnemonics like "tmx", "moses", and "raw".

    Minimising the size... because you won't want the excessive sizes available here... you still have some massive corpus that you could take and then cut them down if you have a decent text editor so something that will get you the sufficient number of TUs you need.  I think if I was you and till thought this was an appropriate thing to do then I'd download one of these TMX files, cut it down so I only had a fraction of the number available, and then use that to get a start with AutoSuggest.  Your biggest problem, based on your comments so far, is likely to be just how to handle such large files in any text editor!!

    But be very sure this is really what you need!

    How large are your TMs so far?  Perhaps a better solution would be to simply machine translate some content that is relevant to the sort of work you do, post-edit it, and use that to get to 10000 translation units.  I imagine you could get there quite quickly.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children
No Data