Arman Gungor's Blog Litigation Support and Technology

14Jan/100

Text Split Merge 1.0 Beta

Text Split Merge 1.0 Beta

Text Split Merge 1.0 Beta

I have lately been working on a small utility for manipulating text files.  I have come to a point where most of the functionality I have been planning on implementing is there, but not yet fully tested. Feel free to give it a whirl if you would like. Here is briefly what it can do:

  • Inserts Law, Ringtail or custom page markers into single page text files.
  • Combines single-page text files into document-level text files by using a text based list of document breaks (page markers can also be inserted at the same time)
  • Identifies page breaks in document-level text files (by the page break character, a custom anchor, Law or Ringtail style page markers), inserts page markers at page breaks and splits the text file into page-level text files.
  • Accepts a text reference file (similar to an Opticon load file) and merges page-level text files by the document breaks provided in the load file¬† (The text reference file can be automatically validated prior to processing).
  • Supports Unicode.
  • Outputs an OCR list file.
  • Mirrors input folder structure or splits text files into subfolders.

A few words of caution:

  • While splitting a document-level text file into page-level text files, Text Split Merge assumes that there are no gaps in the bates numbering scheme as it assigns bates numbers to each individual page. It also assumes that there is no bates overlap between two document-level text files.
  • Having named your text files as their starting bates numbers is a requirement.
  • When merging page-level text files via a document break list, Text Split Merge first sorts the document breaks as well as the input files alphabetically. File names should be zero-filled properly in order for the text files to be combined in the correct order.

To Do:

  • The application needs to be tested thoroughly in different scenarios.
  • Exception handling needs to be improved.
  • Performance improvements.
  • Detailed help file.

Download Text Split Merge Beta 1.0 [36 KB]
Requires .NET Framework 3.5