Worldwide: Embracing E-Discovery In Antitrust Matters: Slow But Steady Progress Toward Convergence Between The U.S. And The UK?

Last Updated: April 4 2016
Article by Ryan C. Thomas
Most Read Contributor in United States, September 2019

Lawyers are sometimes risk adverse and slow to change. Although this tends to lead to a more cautious approach to embracing new technologies, including the use of artificial intelligence, the increasing burden of e-discovery has forced the issue. Lawyers on both sides of matters increasingly are embracing the rise of a technology known as "predictive coding" to identify responsive and nonresponsive documents in private litigation and government investigations. While the United States is on the leading edge of this trend, other jurisdictions, including the United Kingdom, have been slower to follow suit, particularly in antitrust matters.

This is a timely and important issue. Recent research shows that nearly half of the cases requiring UK electronic corporate data to be processed were either in preparation for, or in response to, UK or foreign antitrust and regulatory matters. This dynamic has led to predictions that lawyers in the UK (and elsewhere) are expected to make greater use of artificial intelligence in the near future.

The U.S. experience is illustrative. The two federal antitrust agencies—the U.S. Department of Justice ("DOJ") and Federal Trade Commission ("FTC")—have agreed with parties that predictive coding is useful to cull large volumes of electronically stored information in antitrust investigations. By contrast, there has not been any clear statement on this subject from the UK's Competition and Markets Authority ("CMA"), the UK sector regulators, or the courts. That changed in February 2016, when the High Court of England and Wales for the first time endorsed the use of predictive coding in the UK, relying in large part on judicial acceptance of the technology in the U.S. (Case No: HC-2014-000038, Pyrrho Investments Limited and another v MWB Property Limited, [2016] EWHC 256 (Ch)).

This Commentary discusses the latest trends in the use of predictive coding in U.S. and UK antitrust matters, and how Pyrrho is likely to spur slow but steady progress toward greater acceptance in the UK.

What Is Predictive Coding?

Use of e-discovery tools to alleviate the burdens associated with document-intensive matters is not new. Since the mid-1980s, private litigants have agreed to use keyword searches, "concept-based" searches, and most recently predictive coding as alternatives to manual document-by-document ("linear") review. Generally speaking, predictive coding is a form of artificial intelligence that uses human reviewers' examination of a subset of documents (so-called "seed documents") to "train" computer algorithms to review and "predict" what other documents are responsive. Nowadays, the term "predictive coding" is used interchangeably with "technology-assisted review" ("TAR"), "computer-assisted review," or simply "assisted review."

How Does It Work in Practice?

There are a number of different software platforms capable of performing the necessary analytics for predictive coding. Lawyers work with e-discovery vendors to understand the capabilities of the predictive coding software to ensure that the document population is handled appropriately. For example, some predictive coding models cannot categorize certain file types, which would need to undergo a linear review. However, other predictive coding software platforms do not have the same limitations. In terms of training protocols, there are two broad categories of how predictive coding models can be trained. In "passive learning" protocols, the model is trained by evaluating multiple sets of random samples of documents coded by attorney reviewers. In "active learning" protocols, the computer helps select certain "borderline" documents for attorneys to review to further refine the model more efficiently than entirely random document sets would.

While the various software platforms may employ assorted processes and have varied limitations, a key objective across all of them is the ultimate "recall rate"— i.e., the percentage of relevant documents ultimately discovered—that is validated by the nonresponsive sample results. An agreed-upon recall rate allows litigants, merging parties, and government agencies to vet the effectiveness of the predictive coding platform regardless of the software used.

Use of Predictive Coding in U.S. Antitrust Merger Investigations

In the U.S., the use of predictive coding is becoming standard practice in response to the significant compulsory document requests ("Second Requests") issued by the federal antitrust agencies to parties in antitrust merger investigations. Increasingly, law firms are engaging with the DOJ and FTC on behalf of their clients to use predictive coding to identify responsive documents. Employing this type of technology is becoming necessary to handle the growing volume of emails and other electronically stored information that companies generate and to comply with often stringent time limits dictated either by the merger review process or the deal timetable (or both).

Take, for example, the DOJ and FTC processes. The DOJ has amended its "Model Second Request" to require merging parties to disclose and discuss any "software or technology used to identify or eliminate potentially responsive documents and information produced in response to this Request, including ... predictive coding." If a merging party chooses to use predictive coding, the DOJ and the party typically will agree to a certain recall rate and the opportunity for the DOJ to review statistically significant samples of (nonprivileged) nonresponsive documents to verify the agreed-upon recall rate. The DOJ has generally accepted a 75 percent recall rate with at least a 90 percent confidence level, which acknowledges that no review will be perfect and that approximately 25 percent of the responsive documents will not be produced.

The FTC's position is largely the same. Citing the widespread use of electronic materials and the need to improve the efficiency of its investigations when proposing changes to its rules of procedures, the FTC has stated, "Document discovery today is markedly different than it was only a decade ago.... Searches, identification, and collection all require special skills and, if done properly, may utilize one or more search tools such as advanced key word searches, Boolean connectors, Bayesian logic, concept searches, predictive coding, and other advanced analytics." Accordingly, in August 2015, the FTC also amended its Second Request Model to include instructions on the use of predictive coding.

This embrace of new technology by the DOJ and FTC is encouraging, as a growing body of evidence has demonstrated that linear reviews, in which attorneys review each document one-by-one for responsiveness, are less accurate and generate recall rates well below 75 percent.

Although the U.S. antitrust agencies are leading the way in accepting the use of predictive coding in antitrust matters, the agencies' electronic discovery negotiations are not internally consistent, and the path can sometimes be challenging, depending on the agency staff assigned to the matter. If not reasonably managed (on both sides), discussions about the e-discovery process can last weeks, elevating process over substance, delaying forward progress on the merits of the investigation. Some agency staff may attempt to evaluate how the software is performing before the model is fully trained and to set detailed parameters for the process that are not always or obviously consistent with best practices. As one of the leading U.S. judicial voices in e-discovery, Magistrate Judge Peck, cautioned, "one point must be stressed—it is inappropriate to hold TAR to a higher standard than keywords or manual review. Doing so discourages parties from using TAR for fear of spending more in motion practice than the savings from TAR for review."

The U.S. merger review process puts the burden on the merging parties to certify that they "substantially complied" with a Second Request, and, in such a context, the merging parties should retain broad discretion to use the method they reasonably believe to be appropriate, proportionate, and effective in order to satisfy their duty to comply. The agency staff can evaluate the sufficiency of the documentary response by verifying the agreed-upon recall rate through the review of samples of nonresponsive documents. As Judge Peck explained, "requesting parties can insure that training and review was done appropriately by other means, such as statistical estimation of recall at the conclusion of the review as well as by whether there are gaps in the production, and quality control review of samples from the documents categorized as nonresponsive."

UK Comparison

In the UK, predictive coding has been used in litigation and antitrust matters, but not as often as in the U.S. For example, in a high-profile alleged price-fixing conspiracy investigation some years ago, the CMA's predecessor—the Office of Fair Trading—agreed to the Jones Day antitrust team's request to use predictive coding to identify a few thousand responsive documents from an original collection dataset of several million documents, with a return rate and confidence level similar to those accepted by the DOJ in "Second Request" merger investigations. Despite some experience with the technology, there continues to be no formal statement or guidance from the CMA or other UK sector regulators. This can perhaps be attributed to two reasons: UK antitrust matters, particularly merger reviews, have tended to be less document-intensive than U.S.-style "Second Requests," and there has been a historic reluctance by the English courts (and consequently, lawyers and government agencies) to endorse the use of predictive coding. But things are changing.

First, since April 2014, the CMA's investigatory powers have been strengthened, in particular through a wider power to require parties to produce documents at all stages of an investigation, including during first-phase merger reviews and market studies, and the ability to impose significant financial penalties on parties that fail to comply with an information request notice without a reasonable excuse. UK sector regulators have similar powers.

Second, and crucially, in February 2016, the High Court of England and Wales officially approved the use of predictive coding for the first time in the UK in Pyrrho.

Pyrrho Ruling

This case concerned compensation claims from the shareholders of a company on various grounds, including breach of fiduciary duties. The court ordered the disclosure of all relevant documents. Initially, the total number of electronic files was "more than 17.6 million." After de-duplication, the total was narrowed to 3.1 million documents, which the court observed was "still a large and costly number to search." The parties turned to predictive coding to expedite the review and asked the court to approve this approach. The court noted that "there is not a great deal by way of guidance, and nothing by way of authority, on the use of such software as part of the disclosure process." Such a lack of authority prompted the court to analyze other jurisdictions and to draw comparisons with the well-known U.S. district court decision of Da Silva Moore in which Judge Peck endorsed predictive coding for the first time in judicial proceedings (see our previous Antitrust Alert on the topic).

In approving the use of predictive coding in Pyrrho, Master Matthews listed the following 10 reasons in favor of the use of predictive coding and found "no factors of any weight pointing in the opposite direction":

  • Other jurisdictions have found that predictive coding software is useful in appropriate cases, notably the U.S. (Da Silva Moore).
  • There is no evidence that predictive coding is less accurate than linear review (and indeed, there is evidence that it is more accurate).
  • There is greater consistency in using predictive coding over "dozens, perhaps hundreds, of lower-grade fee-earners, each seeking independently to apply the relevant criteria."
  • There are no prohibitions on the use of predictive coding in the applicable rules of procedure.
  • The number of electronic documents to be reviewed in this case was "huge, over 3 million."
  • The cost of manually searching these documents would be "enormous, amounting to several million pounds at least." The court even goes further to describe a manual review of each document as "unreasonable" where a "suitable automated alternative exists at lower cost."
  • The costs of using predictive coding would be a fraction of the cost of manual review.
  • The value of the claims made in the litigation are in the tens of millions, making the estimated cost of predictive coding proportionate.
  • If the predictive coding is unsatisfactory, there will still be time to consider alternative methods.
  • The parties have agreed on the use of the software and a protocol.

In his closing remarks, Master Matthews noted that the agreed protocol was case specific: "Whether it would be right for approval to be given in other cases will, of course, depend upon the particular circumstances obtaining in them."


Pyrrho was not an antitrust case, but it is nonetheless instructive for the use of predictive coding in UK antitrust matters in at least two respects:

First, Pyrrho has been hailed as a victory for proportionality: The use of predictive coding may be appropriate in circumstances where it is effective in ensuring that disclosure exercises remain proportionate. This is particularly important for UK antitrust matters, where the CMA and the other UK sector regulators are under a duty to make sure that each request for information is justified and proportionate and enables companies to balance their duty to cooperate with the exercise of their rights of defense. The principle that document disclosure requests must be proportionate has recently been reaffirmed in Cases C-247-268/14 P, Italmobiliare and Others—in which the Court of Justice of the EU found that the European Commission's requests for information directed at cement manufacturers in an EU antitrust probe were "extremely numerous" and excessive, and thus annulled such requests. Although these cases relate to European Commission investigations under EU competition law, they are also relevant in principle for the application of UK antitrust rules.

Second, in Pyrrho, the court was satisfied that training and review were done appropriately without the need for the disclosure of the "seed" set of documents to the other side. This is particularly relevant for antitrust investigations. Like U.S. Second Requests, it is for the parties that are subject to a disclosure request from the CMA or other UK sector regulators to certify compliance with such a disclosure request. Therefore, there is a strong argument that parties should remain free to use other reasonable means for vetting the accuracy of their disclosure, such as the statistical estimation of recall at the conclusion of the review based on the quality control review of samples from the documents categorized as nonresponsive and nonprivileged.

Toward Convergence between the U.S. and the UK

Lawyers in the UK are likely to rely on the Pyrrho judgment in the future in support of the use of predictive coding in response to a large document disclosure requests, including in antitrust investigations.

Given that e-discovery is one area where the U.S. is leading the way (as recognized by Pyrrho itself), some guidance could usefully be drawn from DOJ and FTC experience, with a view to achieving a consistent approach to the use of predictive coding in U.S. and UK antitrust matters. Accordingly, in deciding whether it may be appropriate to propose the use of predictive coding in an antitrust investigation, the parties and their lawyers should take into account the following considerations:

Volume, Timing, and Collection Logistics. Consider whether predictive coding is the most efficient solution after evaluation of the document volume and collection logistics. Predictive coding may not save time and money if the volume of documents is low or if documents have to be collected and processed in small, incremental batches.

Experience. Consider whether the investigating agency staff has experience with predictive coding. The CMA and some of the UK sector regulators are increasingly using predictive coding for the prioritization and review of documents disclosed to them in response to information requests. A less-experienced agency staff may be less likely to agree to predictive coding or, alternatively, more inclined to challenge or delay accepting a certification of completeness where predictive coding has been used without advance acceptance by agency staff. In the absence of formal guidance or additional precedent in antitrust investigations in the UK, the conditions set out in Pyrrho for accepting predictive coding provide a useful precedent.

Recall Rate. Make sure that you are comfortable with the recall rates. Even if the agency staff has agreed to the use of predictive coding, you will still be required to certify the efficacy of the methodology and substantial compliance with the document disclosure request.

Methodology and Protocol. Consider what aspects of the methodology and protocol will require prior agreement with the agency. A highly transparent protocol could complicate the review and open the door for an expanded and time-consuming inquiry, especially if the agency staff does not have a good understanding of the technology or visibility of what information is contained in the documents at the outset. The goal of any review process is to return a satisfactory volume of responsive documents, and ultimately the burden rests on the party deploying the technology to use it appropriately to reach the desired recall rate—which the agency can validate through nonresponsive samples. Agreement on the recall rate and verification of that rate through the review of nonresponsive samples should instead be sufficient to endorse any review process without the unnecessary distraction of prolonged discussions regarding the specific software and work flows.

The Continued Need for Some Linear Review. Linear reviews of predicted responsive documents that contain potentially privileged communications, as identified by "privilege" search terms, are still common as parties seek to identify 100 percent of privileged communications. But as technology and legal standards advance, parties to an investigation and their lawyers should keep an open mind and be prepared for further change and development in this area.

The content of this article is intended to provide a general guide to the subject matter. Specialist advice should be sought about your specific circumstances.

To print this article, all you need is to be registered on

Click to Login as an existing user or Register so you can print this article.

Similar Articles
Relevancy Powered by MondaqAI
In association with
Related Topics
Similar Articles
Relevancy Powered by MondaqAI
Related Articles
Related Video
Up-coming Events Search
Font Size:
Mondaq on Twitter
Mondaq Free Registration
Gain access to Mondaq global archive of over 375,000 articles covering 200 countries with a personalised News Alert and automatic login on this device.
Mondaq News Alert (some suggested topics and region)
Select Topics
Registration (please scroll down to set your data preferences)

Mondaq Ltd requires you to register and provide information that personally identifies you, including your content preferences, for three primary purposes (full details of Mondaq’s use of your personal data can be found in our Privacy and Cookies Notice):

  • To allow you to personalize the Mondaq websites you are visiting to show content ("Content") relevant to your interests.
  • To enable features such as password reminder, news alerts, email a colleague, and linking from Mondaq (and its affiliate sites) to your website.
  • To produce demographic feedback for our content providers ("Contributors") who contribute Content for free for your use.

Mondaq hopes that our registered users will support us in maintaining our free to view business model by consenting to our use of your personal data as described below.

Mondaq has a "free to view" business model. Our services are paid for by Contributors in exchange for Mondaq providing them with access to information about who accesses their content. Once personal data is transferred to our Contributors they become a data controller of this personal data. They use it to measure the response that their articles are receiving, as a form of market research. They may also use it to provide Mondaq users with information about their products and services.

Details of each Contributor to which your personal data will be transferred is clearly stated within the Content that you access. For full details of how this Contributor will use your personal data, you should review the Contributor’s own Privacy Notice.

Please indicate your preference below:

Yes, I am happy to support Mondaq in maintaining its free to view business model by agreeing to allow Mondaq to share my personal data with Contributors whose Content I access
No, I do not want Mondaq to share my personal data with Contributors

Also please let us know whether you are happy to receive communications promoting products and services offered by Mondaq:

Yes, I am happy to received promotional communications from Mondaq
No, please do not send me promotional communications from Mondaq
Terms & Conditions (the Website) is owned and managed by Mondaq Ltd (Mondaq). Mondaq grants you a non-exclusive, revocable licence to access the Website and associated services, such as the Mondaq News Alerts (Services), subject to and in consideration of your compliance with the following terms and conditions of use (Terms). Your use of the Website and/or Services constitutes your agreement to the Terms. Mondaq may terminate your use of the Website and Services if you are in breach of these Terms or if Mondaq decides to terminate the licence granted hereunder for any reason whatsoever.

Use of

To Use you must be: eighteen (18) years old or over; legally capable of entering into binding contracts; and not in any way prohibited by the applicable law to enter into these Terms in the jurisdiction which you are currently located.

You may use the Website as an unregistered user, however, you are required to register as a user if you wish to read the full text of the Content or to receive the Services.

You may not modify, publish, transmit, transfer or sell, reproduce, create derivative works from, distribute, perform, link, display, or in any way exploit any of the Content, in whole or in part, except as expressly permitted in these Terms or with the prior written consent of Mondaq. You may not use electronic or other means to extract details or information from the Content. Nor shall you extract information about users or Contributors in order to offer them any services or products.

In your use of the Website and/or Services you shall: comply with all applicable laws, regulations, directives and legislations which apply to your Use of the Website and/or Services in whatever country you are physically located including without limitation any and all consumer law, export control laws and regulations; provide to us true, correct and accurate information and promptly inform us in the event that any information that you have provided to us changes or becomes inaccurate; notify Mondaq immediately of any circumstances where you have reason to believe that any Intellectual Property Rights or any other rights of any third party may have been infringed; co-operate with reasonable security or other checks or requests for information made by Mondaq from time to time; and at all times be fully liable for the breach of any of these Terms by a third party using your login details to access the Website and/or Services

however, you shall not: do anything likely to impair, interfere with or damage or cause harm or distress to any persons, or the network; do anything that will infringe any Intellectual Property Rights or other rights of Mondaq or any third party; or use the Website, Services and/or Content otherwise than in accordance with these Terms; use any trade marks or service marks of Mondaq or the Contributors, or do anything which may be seen to take unfair advantage of the reputation and goodwill of Mondaq or the Contributors, or the Website, Services and/or Content.

Mondaq reserves the right, in its sole discretion, to take any action that it deems necessary and appropriate in the event it considers that there is a breach or threatened breach of the Terms.

Mondaq’s Rights and Obligations

Unless otherwise expressly set out to the contrary, nothing in these Terms shall serve to transfer from Mondaq to you, any Intellectual Property Rights owned by and/or licensed to Mondaq and all rights, title and interest in and to such Intellectual Property Rights will remain exclusively with Mondaq and/or its licensors.

Mondaq shall use its reasonable endeavours to make the Website and Services available to you at all times, but we cannot guarantee an uninterrupted and fault free service.

Mondaq reserves the right to make changes to the services and/or the Website or part thereof, from time to time, and we may add, remove, modify and/or vary any elements of features and functionalities of the Website or the services.

Mondaq also reserves the right from time to time to monitor your Use of the Website and/or services.


The Content is general information only. It is not intended to constitute legal advice or seek to be the complete and comprehensive statement of the law, nor is it intended to address your specific requirements or provide advice on which reliance should be placed. Mondaq and/or its Contributors and other suppliers make no representations about the suitability of the information contained in the Content for any purpose. All Content provided "as is" without warranty of any kind. Mondaq and/or its Contributors and other suppliers hereby exclude and disclaim all representations, warranties or guarantees with regard to the Content, including all implied warranties and conditions of merchantability, fitness for a particular purpose, title and non-infringement. To the maximum extent permitted by law, Mondaq expressly excludes all representations, warranties, obligations, and liabilities arising out of or in connection with all Content. In no event shall Mondaq and/or its respective suppliers be liable for any special, indirect or consequential damages or any damages whatsoever resulting from loss of use, data or profits, whether in an action of contract, negligence or other tortious action, arising out of or in connection with the use of the Content or performance of Mondaq’s Services.


Mondaq may alter or amend these Terms by amending them on the Website. By continuing to Use the Services and/or the Website after such amendment, you will be deemed to have accepted any amendment to these Terms.

These Terms shall be governed by and construed in accordance with the laws of England and Wales and you irrevocably submit to the exclusive jurisdiction of the courts of England and Wales to settle any dispute which may arise out of or in connection with these Terms. If you live outside the United Kingdom, English law shall apply only to the extent that English law shall not deprive you of any legal protection accorded in accordance with the law of the place where you are habitually resident ("Local Law"). In the event English law deprives you of any legal protection which is accorded to you under Local Law, then these terms shall be governed by Local Law and any dispute or claim arising out of or in connection with these Terms shall be subject to the non-exclusive jurisdiction of the courts where you are habitually resident.

You may print and keep a copy of these Terms, which form the entire agreement between you and Mondaq and supersede any other communications or advertising in respect of the Service and/or the Website.

No delay in exercising or non-exercise by you and/or Mondaq of any of its rights under or in connection with these Terms shall operate as a waiver or release of each of your or Mondaq’s right. Rather, any such waiver or release must be specifically granted in writing signed by the party granting it.

If any part of these Terms is held unenforceable, that part shall be enforced to the maximum extent permissible so as to give effect to the intent of the parties, and the Terms shall continue in full force and effect.

Mondaq shall not incur any liability to you on account of any loss or damage resulting from any delay or failure to perform all or any part of these Terms if such delay or failure is caused, in whole or in part, by events, occurrences, or causes beyond the control of Mondaq. Such events, occurrences or causes will include, without limitation, acts of God, strikes, lockouts, server and network failure, riots, acts of war, earthquakes, fire and explosions.

By clicking Register you state you have read and agree to our Terms and Conditions