Friday, September 6, 2013

Need for speed in today's eDiscovery

Vendor for your eDiscovery Needs: Alaya Discovery

The amount of data that needs processed today for eDiscovery is astounding. From just a few years ago where 10 Gb was considered big data, and where the processing vendor could take up to a week to process, today, a terabyte of data can be processed within a 24 hour period.

While dealing with big ESI, the best way to handle is to go the option of ECA (Early Case Assessment). This will involve removing any redundant data that is not relevant to your case. This can be achieved by a number of methods.


1. Deduplication (either on a custodian level or global level)
2. Date range filtering (exclude items that are not within your date range)
3. Term filtering (you search for specific terms or keywords, and process those documents containing the terms)
4. NIST filter (removal of system files by comparing against standard HASH values of the files in a database)
5. E-Mail thread identification (identify emails by conversation and quickly determine which are responsive, and disregard the non-responsive conversations)
6. Near Duplication (group together similar documents based on text content so that reviewers can focus on the pivot documents and suppressing the duplicates to save time)

The next step is to review responsive documents, tag them, tiff them for redactions, etc, and put to final production.

Next item: Types of Processing.

No comments:

Post a Comment