We need to distinguish between two cases:
1. Automated tests generate a list of suspicious pairs.
2. A pair is investigated.
For case 1, where an automated test has highlighted a pair and indicated that they might be cheating, more detailed tests should be run.
If a pair is investigated, and is guilty, but the tests indicate no cheating then this is a concern for the tests. For example, if a pair is only minimally cheating, it is difficult to both detect and also to prove.
So far, I have not had any cases, but some should be expected. This is a mild concern - allowing cheating players to "get away with it". However, this is a much lesser concern than the next scenario.
What can be done to reduce the number of false positives?
There is not a single test for detecting cheating in Bridge. There are multiple tests. If some tests indicate "no cheating", but some tests indicate "cheating" then a more detailed analysis of the boards may be needed.