chief foia officers council - archives.gov
TRANSCRIPT
![Page 1: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/1.jpg)
CHIEF FOIA OFFICERS COUNCIL
- TECHNOLOGY COMMITTEE –
ARTIFICIAL INTELLIGENCE
WORKING GROUP
AI IN FOIA OVERVIEW
Nick Wittenberg - Chair
Michelle McKown
Jennifer MacDonald
1
![Page 2: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/2.jpg)
FOIA AI Overview: Backlogs, Boredom, Bodies 2
![Page 3: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/3.jpg)
3
![Page 4: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/4.jpg)
4 FOIA PROGRAMS LEVERAGE AI TO MEET GOALS
![Page 5: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/5.jpg)
ARTIFICIAL INTELLIGENCE: ROBOTIC PROCESS AUTOMATION:
AUTOCORRECT FIND AND REDACT
What is AI? 5
![Page 6: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/6.jpg)
Basic Technological Solutions are Advancing 6
Boolean Filters Custodian Subject
Keyword Search Within a Search
![Page 7: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/7.jpg)
7
• “Nearly all information created today is created in digital form, with dizzying arrays of technologies creating dizzying arrays of data types, at rates and in volumes that continue to grow geometrically, with an estimated 89 Tools are billion business emails sent each day and large
available to organizations now measuring data stores in petabytes. At the same time, advances in digital forensics, information assist- retrieval, and other disciplines have yielded a plethora of tools that make it possible to conduct discovery in even the largest cases in a manner that is defensible, timely, and cost effective.”
![Page 8: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/8.jpg)
8
WHY AI?
DATA SIZE HAS GROWN
3 MEGAPIXEL CAMERA’S VS. XXL MEGAPIX
OUTSTRIPS AN INDIVIDUAL FOIA OFFICER’S ABILITY TO MANAGE DATA WITHOUT ROBUST TECH TOOLS
http://www.sdsdiscovery.com/resources/data-conversions/; http://catalystsecure.com/blog/2011/04/understanding-and-managing-costs-in-e-discovery/; http://catalystsecure.com/blog/2011/04/understanding-and-managing-costs-in-e-discovery/;
![Page 9: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/9.jpg)
9
Artificial Intelligence and FOIA
• Artificial Intelligence (AI) has made litigation and FOIA more efficient and accurate
• Typically in litigation 10 - 15% Responsive* Depends on population size, privileges, exemptions, and other sensitivities.
• Training the Machine: – Example: Train machine on 10,000 documents for
a population of 100,000 documents. Training set used to code majority of population. However, maybe 20,000 documents that don’t get coded because they are foreign language, corrupt files, excel, or other structured data. However, you still saved an incredible amount of time and were way more accurate with the 80,000 coded from the 10,000 document training set.
![Page 10: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/10.jpg)
10
Artificial • Solution = Technology Assisted Review (TAR)
– Predictive Coding – also known as TAR 1.0 Intelligence – Continuous Active Learning – also known as
TAR 2.0 and FOIA – Cluster Visualization
![Page 11: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/11.jpg)
Predictive Coding 11
•
•
•
•
•
Also know as Tar 1.0
Decade old tech
Accepted in Courts in 2012.
Sample of review
Need very senior person to help make d ecisions
![Page 12: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/12.jpg)
Continuous Active Learning 12
•
•
Also known as TAR 2.0
Judge Peck- courts would allow continuous active learning if a party requested it
–
–
–
–
Start at doc 1
Doesn’t take random samples throughout population like Tar 1.0
Based on decisions
Content is separated into piles
![Page 13: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/13.jpg)
m1age ~vertisement l
email / ~rtising
13
Cluster Visualization
![Page 14: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/14.jpg)
De-Duplication
Propagation
Copy from Previous
Batching by Custodian or
Saved Searches
Relativity Integration
Points
Features to Power FOIA Reviews 14
![Page 15: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/15.jpg)
15
WHO ELSE IS USING IT, WHY?
![Page 16: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/16.jpg)
- - - -- -
16
eDiscovery & FOIA
• eDiscovery has been used in the legal realms for over a decade-Judges accept it
• Large Data cases caused a need for electronic discovery review where manually going through bankers boxes, files, and documents could not be done in a timely fashion
– Enron, WorldCom, Arthur Andersen*
• Software is dramatically updated over the years to make process more efficient and accurate
• Using the advancements appreciated in the legal world so too can FOIA appreciate the results
• https://zapproved.com/project/a timeline of electronic discovery/ • https://www.investopedia.com/updates/enron scandal summary/
![Page 17: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/17.jpg)
• Case – culling down to focus review.
– 1,000 in batch, 200 likely responsive, 800 likely not
• Report to show
– Responsiveness
– Privilege
– Exemptions
• •
•
Auto Redaction Software
Exemption 1 and CBI productions can be updated at a later time when matters are determined to be downgraded.
Reviewed and Produced can maintain same coding decisions and redactions
– Future request – receive a pre-case assessment that states how many records are in this collection and how many have been reviewed and produced
Further Technological Search and Previously Produced Solutions
17
![Page 18: CHIEF FOIA OFFICERS COUNCIL - archives.gov](https://reader036.vdocument.in/reader036/viewer/2022062316/62a8b75141116054d316ac34/html5/thumbnails/18.jpg)
18 “I REALLY THINK THE FUTURE OF
FOIA IS TAKING IT TO THE NEXT LEVEL AND USING ARTIFICIAL INTELLIGENCE, AND USING SOFTWARE THAT CAN DO THINGS LIKE GROUP RECORDS TOGETHER, EITHER BY CONCEPT OR RELATIONSHIP, THOSE KINDS OF SOFTWARE.”
MELANIE PUSTAY, DIRECTOR, DEPARTMENT OF JUSTICE’S OFFICE OF INFORMATION POLICY (OIP), MAY 03, 2019