Valora Technologies Inc.: May 2012

Friday, May 4, 2012

3 Drawbacks To Predictive Coding

Valora’s Response to LTN article: Take Two: Reactions to 'Da Silva Moore' Predictive Coding Order

What is missing there, and elsewhere, is a discussion of the specific weaknesses of the overall Predictive Coding technique. Here are just three drawbacks of the technique:

PC tagging algorithms are not transparent. No one really knows why the PC engine "chose" the documents it did. Typically, the “choosing” algorithm is hidden and not disclosed. All we know is that somehow the document recognized is a lot like another tagged.
PC has no checks or balances on the skill set, education, consistency or motivations of the seed set coder(s). The entire Predictive Coding approach assumes that the seed set coder(s) know what they are doing, and that they are correct, consistent and honest. Would you defend that position, particularly given that the “human being as gold standard" concept has been roundly deflated (see Blair & Maron, Grossman, TREC, etc.)?
Typically, seed set creation and audit sampling for PC use a random sampling technique, the weakest of all types.

Other sampling techniques (stratified, cluster, panel, etc.) are aware of document attributes and utilize intelligent groupings to create a much stronger, more representative sample for seed set coding and auditing purposes.

Since at present, all Predictive Coding solutions are products, which means they have limited functionality and flexibility for specific case matters, perhaps we should be thinking about the broader picture of Technology-Assisted Review (TAR) as a service – customizable, measurable and transparent.

About the Author

Sandra E. Serkes, President & CEO of Valora Technologies Inc.
Ms. Serkes is a dynamic leader with an extensive background spanning over 20 years in software marketing, product management and corporate strategy, particularly in document processing, computer telephony and speech recognition. One of Valora's original founders, Ms. Serkes has been actively involved in Valora since its inception in 2000. Today, Ms. Serkes oversees Sales & Marketing, Finance & Administration, Operations, Engineering and Corporate Strategy.

A graduate of both Harvard Business School and MIT, Ms. Serkes is a frequent industry speaker and panelist. She is an active participant in the Women Presidents' Org., The Commonwealth Institute, the MIT Enterprise Forum, the Massachusetts Software Council and the Network of Harvard Alumnae. Ms. Serkes serves on the boards of several technology and service start-ups. Ms. Serkes was named a 2006 "Woman to Watch" by Women's Business Magazine.

About the Guest Authors

Aaron Goodisman is a software industry veteran with over 20 years experience in engineering management, software architecture, and product development. Prior to founding Valora Technologies, Mr. Goodisman served as Vice President of Engineering at SilverStream Software, acting as both manager and visionary for this award-winning application server product and its associated development and deployment tools.

Mr. Goodisman received his undergraduate and Master's degrees in Computer Science from MIT and is considered a world expert in Java industry standards and UI design. He is a frequent industry speaker and has authored several articles for industry publications. Mr. Goodisman is named as the inventor on several U.S. patents and currently pending patent applications.

Aaron's Blog Articles:
Electronic Files Rehashed

Friday, May 4, 2012

3 Drawbacks To Predictive Coding

Valora’s Response to LTN article: Take Two: Reactions to 'Da Silva Moore' Predictive Coding Order

About the Author

About Valora

Blog Archive

About the Guest Authors