Studio 2022 is giving different analysis results to Studio 2021. Many 100% matches are dropping to 99%.

We have found Studio 2022 is giving different analysis results to Studio 2021. We created two projects with the exact same source files, the exact same template, with the exact same TMs and the exact same project settings. We noticed a slight difference in the total word count between the analysis reports from both Studio versions, but the number of locked segments (100% matches), (cross-file) repetitions and fuzzy matches varied considerably. Studio 2022 gave many more fuzzy matches  than Studio 2021 (especially 99% when we were expecting results of 100%).

How is it possible for such a difference to occur between both versions of Studio?

emoji
Parents
  •  

    There are sometimes small changes in the way files handle the content between versions; there are sometimes changes in the way the TM counts things (less often that this occurs).  So you need to help and provide us with some context and explanation of where you see changes.

    For example, and you could do this yourself... if you provided me with a file this is what I would do:

    1. scan the file and see if anything stuck out as being an obvious place to look
    2. split the file and analyse both halves... identify which half was different and split that are repeat... ad repeat... until it became obvious to me where the difference was coming from

    This difference could be file specific and most probably content specific.  So to answer your question we'd need a lot more context than you have provided.  For us, it's probably not a bug (although this cannot be ruled out) so you need to do the work on this if you want to identify what's different.  Unless you have a support contract in which case you should log a case and the support team will be able to help.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Reply
  •  

    There are sometimes small changes in the way files handle the content between versions; there are sometimes changes in the way the TM counts things (less often that this occurs).  So you need to help and provide us with some context and explanation of where you see changes.

    For example, and you could do this yourself... if you provided me with a file this is what I would do:

    1. scan the file and see if anything stuck out as being an obvious place to look
    2. split the file and analyse both halves... identify which half was different and split that are repeat... ad repeat... until it became obvious to me where the difference was coming from

    This difference could be file specific and most probably content specific.  So to answer your question we'd need a lot more context than you have provided.  For us, it's probably not a bug (although this cannot be ruled out) so you need to do the work on this if you want to identify what's different.  Unless you have a support contract in which case you should log a case and the support team will be able to help.

    Paul Filkin | RWS Group

    ________________________
    Design your own training!

    You've done the courses and still need to go a little further, or still not clear? 
    Tell us what you need in our Community Solutions Hub

    emoji
Children
No Data