The 2017 JFK Records Act releases at NARA: Q & A

After looking at Harold Weisberg’s archive last year, I became interested in the JFK assassination, a subject I had never really looked at before. The recent NARA release of documents under the JFK Records Act, which was supposed to be the final release of all previously withheld documents, was therefore quite interesting to me. Unfortunately, the newspaper reporting on this event was not very good, and I eventually decided to take a look at what had come out myself. The release provided some materials that I had not expected to run across, which I will write about in the not too distant future. For this short note, however, I’m just going to summarize the documents releases in a general Q&A form.

When were the documents released and what more is to come?

As of today (November 14, 2017), there have been four public releases of records under the JFK Records Act in 2017. The releases were on July 24, October 26, November 3, and
November 9, and are described in detail on the National Archives and Record Administration (NARA) website, which provides an authoritative excel file of the records released so far. This post is actually just a summary of the NARA excel sheet, I claim nothing original here, except on one small point. I am certain that there will be a number of releases in the future, which I will try to look at as well. President Trump’s October 25th order gave everyone until April 2018 to finish up releasing documents, but who knows if they can really do it? I am somewhat skeptical.

How many documents were released?

In this post, I’ll refer to files posted at NARA as ‘documents’, regardless of whether they were originally paper documents or otherwise. In fact, 17 of the ‘documents’ are audio files of interviews, or interrogations, of Yuri Nosenko. I do not include these in any of my counts. In addition, 16 of these 17 audio files have matching pdf files; these are just single sheets, providing recording dates, tape numbers, and in some cases information about the interviewers (Pete, Nick, Tom, FBI, etc.) If these count as separate documents (since they are separate files), that would add another 16 to the total count. For my purposes, however, I’ll omit them, meaning that the total number of JFK Records Act documents released this year is 20,552. The number of documents released for each date is as follows:

Date No. Docs
7/24 3794
10/26 2891
11/3 654
11/9 13213

These figures are not the same as the official NARA figures. This is because the NARA says that 676 files were released on 11/3; in fact, 22 of these files had already been released on 7/24, so that only 654 new files were released on 11/3. If we were counting the 16 single ‘cover-sheets’ for the audio files, the total would be 20,568 discrete pdf files now available from NARA.

(There is also a minor error on the NARA spreadsheet: RIF# 104-10086-10154 appears twice, but in fact the second of these two documents should be 104-10086-10156. NARA has posted this file under the correct filename, so it is available.)

How many pages were released?

Putting aside the audio files and their cover sheets, the NARA files are all pdfs ranging from 1 to 562 pages. Many of the files have an RIF coversheet, which is supplied by NARA and gives basic JFK Record information. These should not be counted as part of the documents, but since some files have them and some don’t, it is hard to say how many RIF coversheets there are without going through all 20500+ files. Disregarding this problem, here is a table of number of pages per file:

Page range No. of Docs
> 200 115
100 – 199 161
50 – 99 207
30 – 49 204
20 – 29 206
10 – 19 610
6 – 9 1065
5 654
4 1206
3 2707
2 5861
1 7538

Based on this table, the total number of pages released so far this year is 133,761.

This is a surprisingly high number. It is inflated by the fact that the releases include a large number of documents already released in part in previous years. As an example, RIF # 104-10301-10004 is volume 3 of the CIA’s internal history of the Bay of Pigs operation. The NARA document has 408 pages, but in fact this volume has been previously released multiple times (most recently in 2011). There were about a dozen pages with redactions in the 2011 release, and the current NARA release has removed perhaps 4 or 5 of these (redactions remain even in this release). Thus, as many as 390-400 pages of this document have no new information, and even the redactions removed were usually just one or two words. If one ignores this type of ‘inflation’, it is easy to erroneously conclude that much more new material was released than is actually the case.

Another way to look at this question is to compare the number of pages from documents “previously withheld in full” as opposed to the number of pages from documents “previously withheld in part”:

Previous document status Number of docs Number of pages
withheld in part 19510 73427
withheld in full 1039 60271

These numbers are more in line with what we might have expected. The withheld in part documents had withheld a relatively small number of pages; on the other hand, the withheld in full documents, though relatively few, have so far turned out to be more voluminous than some had guessed. Among these newly released documents are the CIA operative files of a number of well-known people, some of these hundreds of pages long. Although often quite dull, these can sometimes give a detailed picture of the careers of important CIA officers such as David Phillips and Scotty Miler, and of CIA asset June Cobb, who many people were very curious about. Whether these people have very much to do with the JFK assassination is of course a different question. There are also lengthy transcripts of telephone taps, and compilations of surveillance photos that, in my opinion, contribute very little to our understanding, except that we can now see that the HSCA was right when they said ‘Nope, nothing there.’

What dates are covered by the released documents?

Most of the documents in this year’s releases are dated. Exceptions are the CIA op files mentioned above. Since some of these files cover decades, it is not reasonable to assign them to any particular date. There are also a fair number of documents where the NARA spreadsheet does not provide a date, even though a date is clearly indicated in the file, sometimes even on the RIF coversheet included at the beginning of the file. No doubt these omissions were due to the massive volume of files releases. In the end, though, there are only 1352 files with no year, so most of the files are dated.

Since dates may be a convenient way of indicating content (and interest) of the releases, these are worth examining more closely. The top year for documents, of course, was 1963; almost 4000 documents of the releases so far are from this one year. In terms of decades, the 1960s are naturally the main document era, with the 1970s a distant second, mostly releases of HSCA documents. There are a mere handful of releases from the 1980s. This is because it was the only decade in which there was no major investigation of the assassination. Several hundred documents were also released in the 1990s, almost all by the ARRB. Here is all this descriptive stuff in a table:

Decade Number of documents
no year 1352
1940s 137
1950s 1485
1960s 13796
1970s 3166
1980s 41
1990s 575

(note that this table follows common practice in counting ‘ought’ years such as 1960 as belonging with the 1960s)

Earliest and Latest files

The most recent documents released so far are dated 1999; these are primarily from Barry Harrelson at CIA, who was coordinating with NARA on release of HSCA CIA segregated materials after the ARRB closure in 1998. A more mysterious item is a letter from Scott Breckinridge to Charles Briggs. Don’t know why this letter is included, since it dates from after the ARRB closed. The earliest dated document is an FBI report on Paul Raigorodsky, from 1941. Raigorodsky was “a member of the Russian-speaking community in Dallas” and testified before the Warren Commission on 3/31/64. There are plenty of files released so far which include earlier documents than this, but these files are all collections of various documents, such as the CIA op files, and most of these were assembled in the 1960s or 70s.

Agencies releasing documents

The NARA spreadsheet also indicates the agencies which produced, or had charge of, the documents being released. This again gives one a fairly clear idea of the type of material in the documents. The table below gives counts for all agencies credited with more than 10 documents.

Agency No. of Documents
CIA 15762
FBI 3760
HSCA 294
NSA 244
SSCIA 296
Blank 110
JCS 43
NARA 29
INSCOM 34
ROCKCOM 16
LBJ 11

(SSCIA = Senate Select Committee on Intelligence Activities, ‘Church Committee’; JCS = Joint Chiefs of Staff, INSCOM = Army Intelligence and Security Command; ROCKCOM = Rockefeller Commission; LBJ = Lyndon B. Johnson Library)

Miscellaneous information

The NARA spreadsheet provides several other types of information, including the latest review dates of the documents, and comments on individual files. All this is useful for those who want to know what is going on ‘behind the scenes’ at NARA, but since that is not my main interest, I’ll omit it for now.

Final note

So far, I have barely scratched the surface of the files released, and since my overall knowledge of the JFK assassination’s many controversies is very limited, I will refrain from offering anything more than the most general comments on what’s out so far. As far as I can tell, much of this material is only distantly related to the assassination; so much so that in many cases it takes great familiarity with points of controversy to see the relevance. This is perhaps not surprising; those who had read this material prior to its release, people such as John Tunheim, the former chairman of the ARRB, have generally warned that there are few surprises to come. But for those who have more general or tangential interests, there is doubtless much of interest. An example is the large number of materials related to the Golitsyn – Nosenko controversy. For me, there were other surprises as well. In the not too distant future, I will try to write up at least one such item that “knocked my eyeglasses off.”

Posted in History, JFK ARC | Comments Off on The 2017 JFK Records Act releases at NARA: Q & A

Zotero to Endnote: part III

This is the latest update of a note I first posted four years ago. Below is a summary of what I’m trying to do, then the revised solution.

I use both Zotero and Endnote to do bibliographies. References that I have as pdf files I keep in a folder on D: drive. I have set Zotero to use a custom directory for these: d:/bibfiles.

It is often convenient to have my Zotero data in an Endnote library. Zotero can export data to Endnote in RIS format, and I want to have the locations of the pdfs included in the Zotero export file. The problem has been that, out of the box, if you tell Zotero that the export file should include pdf locations, Zotero will give you an ris file with ‘internal locations’, and these cannot be directly imported into Endnote. In addition, Zotero will also copy ALL the pdfs into a subdirectory below the export file. I have close to 40g of pdfs in my directory, so this is NOT what I want. I just want the pdf file locations in the export file, not copies of the pdf files.

Here is the recipe to fix this problem. You can change Zotero’s default behavior by editing the ris.js file under the zotero/translators directory. Search for the phrase ‘local file’. Under this phrase there are two lines:

value.push(att[j].defaultPath);
att[j].saveFile(att[j].defaultPath);

Edit the first line to read
value.push(att[j].localPath);

Then comment out the second one:
//att[j].saveFile(att[j].defaultPath);

The first edit will produce an ris export file with the attachment filenames using the correct directory name, i.e. d:/bibfiles/ref1.pdf etc. The second edit will keep Zotero from trying to copy all the pdfs in d:/bibfiles in a subdirectory under your ris export file.

An important caution! The ris.js file is sometimes automatically updated when a new version of Zotero is installed. This will wipe out any changes you do, and cause Zotero to revert to its default behavior. You should check whether the ris.js still has your edits in it before you use it. If you want the default behavior back, you will have to revert the changes yourself. Adding an extra option in the Zotero menus for this type of export may be possible, but it is far beyond my abilities at this point.

Posted in Research methods, Software | Comments Off on Zotero to Endnote: part III

Welcome to Chi Nan, Fall 2017

Hi! The new semester is now into its second week and I’m a little bit behind on most of my stuff as usual. To the new students, Welcome! To the sophomores, juniors, and seniors, hope you had a good summer and have got the classes you want/need this semester. Looking forward to meeting my advisees, give me an email or leave a note here if you’re stopping by!.

Posted in School | Comments Off on Welcome to Chi Nan, Fall 2017

Capturing system command output in perl

Running system commands from inside a perl script is one of the nastiest of all perl headaches. I suffered grievously when trying to use this method with some pdf file addons, and have scars to show for it. Lately, having failed to learn my lesson not to mess with this stuff, I have been working on something similar: trying to use the new version of 7-zip to get crc hashes for the excel files in a directory.

The idea is to use the hash checksums to see which files have been modified, compared to an earlier archived version of the files. 7-zip command line syntax is already tricky and I have no idea what I’m doing in most of this; I’m just following incantations found on various occult internet sites. Here is the command line version of the 7-zip command:

"c:\Program Files\7-zip\7za.exe" h -ir!d:\foo\*.xls -ir!d:\foo\*.xlsx > temp.txt

This gets the crc hashes for all the xls and xlsx files in the d drive directory foo. Doing this in a perl script is equally tricky; the key is putting the various segments into an array, as follows:

my @cmds = (
'c:\Program Files\7-Zip\7za.exe', 'h',
'-ir!d:\foo\*.xls',
'-ir!d:\foo\*.xlsx'
);

One then uses system to invoke the mystic phrases, and an ‘or’ clause to ask for an explanation if it fails:

system (@cmds) == 0
or die "system @cmds failed: $?\n";

The procedure described above works, but the mind boggling issue is how to get the output of this command (lengthy scrolling for about 20 seconds) into a perl variable where you can do something with it. This task completely defeated me after half a day. I then switched to trying to pipe the output into a file. This accounted for the second half of my day. In fact, there is a ready made solution to this problem: Capture::Tiny, written by David Golden and included in the ActivePerl distribution.

The final result is:

use Capture::Tiny ':all';
my ($stdout, $stderr, $exit) = capture {
system( @cmds );
};

The output of the system call is in $stdout, the error info is in $stderr, and the exit return code is in $exit.

All hail David Golden! Hail!!

Posted in Programming | Comments Off on Capturing system command output in perl

Welcome to Spring Semester!

Hello! Fall 2017 was a very quiet semester in the warren, hopefully there will be more coming up this spring. If you want to get in touch and you’re an NCNU student, NCNU’s Moodle site is probably the best way. If you’re not from NCNU and you’re interested in a post here, please try my regular email address, which is available from the DFLL webpage at National Chi Nan University.

Posted in School | Comments Off on Welcome to Spring Semester!

Welcome to the new semester at Chi Nan!

Another semester begins, and I hope all the students in the Department of Foreign Languages and Literature had a pleasant first day. I’m looking forward to the semester, and to seeing all of you.

Posted in School | Comments Off on Welcome to the new semester at Chi Nan!

“I have always had adequate sex that no one appreciated”: The best of Studies in Intelligence

Taking a 10 minute pause from grading final exams, I would like to announce that the Warren will begin a new series introducing government funded fun and enlightenment from Studies in Intelligence, the CIA’s in-house journal. While there is much food for thought in the more analytical articles, there is also plenty of idle-reading entertainment hidden in the shadowy depths of this mysterious but official publication. Fearlessly footpadding through its classified depths, I have located the best of these cloaked gems and will expose them to the world at large at irregular intervals. Today’s recommendation is, in my opinion, one of the most incisive pieces SI ever published. What are you waiting for? Click here and read on.

Posted in History, Intelligence | Comments Off on “I have always had adequate sex that no one appreciated”: The best of Studies in Intelligence

Chiang Kai-shek Meets Tu Yueh-sheng

This note picks up from my last post on the sources and claims of Sterling Seagrave’s 1985 book, The Soong Dynasty (SD).

SD enjoyed a fourteen-week run on the New York Times best-seller list and launched Seagrave’s career as a chronicler of Asian history. His books since then have  generally met with success, and in the case of SD, have even had some influence on more scholarly works. This is unfortunate, because SD is one of the least reliable books on history I have ever read.

Tu Yueh-sheng, Chiang Kai-shek, and the Soongs

One of the main themes of Seagrave’s book is the role of Tu Yueh-sheng, leader of Shanghai’s Green Gang, in the lives and fortunes of the famous Soong family, and in the career of Chiang Kai-shek, perhaps the most prominent politician and military leader of pre-Communist China.

According to Seagrave, Chiang Kai-shek’s relationship with Tu Yueh-sheng can be traced back to the the beginning of China’s Republican era (1911). Through this relationship, Chiang, fronting for the Green Gang, helped Tu over the Nationalist Party (the KMT), the ruling party of China from 1927 to 1949.  As Seagrave says:

Chiang’s direct connection with the notorious Shanghai Green Gang after the winter of 1926-27 has been known for many years, but there has been only a vague understanding that those links went back much earlier, and of how they affected his career. It is now possible for the first time to see the “Divine Skein” linking them all the way back to his youth, before 1910, and the manner in which the Green Gang leaders used Chiang decisively (and were used by him) to snatch the revolution from the hands of Dr. Sun Yat-sen’s coalition. (12)

In fact, Seagrave’s evidence for an earlier alliance between Chiang and Tu is non-existent, based on a single spurious citation.

A non-existent reference

Seagrave’s most specific claim for an early acquaintance between Chiang and Tu is given on page 152 of the Soong Dynasty:

Big-eared Tu enjoyed visiting the famous Blue Villa and cruising the other Green Gang brothels in the Blue Chamber District with a young, ill-tempered bravo by the name of Chiang Kai-shek.

My last post on Seagrave’s book attempted to find a source, any source, for the existence of a Shanghai brothel called the Blue Villa, staffed, according to Seagrave, by 121 prostitutes (SD 158).  As it turned, there was indeed a book which mentioned such a place: a 1977 novel by Alain Robbe-Grillet called Topology of a Phantom City.

The fictional provenance of this detail cannot inspire confidence in the reliability of Seagrave’s claims, but for the general fact of an early acquaintance between Chiang and Tu, Seagrave’s endnote gives what seems to be a non-fictional source: “The intimate brothel relationship between Tu and Chiang at this early stage is noted by Murphey, pp. 7-9.”

This is a reference to Rhoads Murphey’s 1953 book Shanghai: Key to Modern China.  Unfortunately, this reference is as fictional as the Blue Villa itself. Pages 7-9 of Murphey do not mention Tu or Chiang.  In fact Tu Yueh-sheng is not mentioned once in Murphey’s book. Chiang is mentioned, once, but it is a quote on Shanghai’s status in the 1943, with no relevance to the brothels of early 20th century China (Murphey, 25).

The spurious nature of Seagrave’s citation was first noted by C. Martin Wilbur, an important scholar of China’s Republican era, in a review of Seagrave’s book, which Wilbur titled “Fabricating History.” 1

Wilbur’s review examines only the first 200 pages of Seagrave’s book, compiling a long list of errors, exaggerations, “flourishes” and “embellishments” and several claims that Wilbur bluntly calls fabrications.

Discovering that Murphey’s book had nothing on Tu and Chiang at the Blue Villa, Wilbur went through some of Seagrave’s other materials and decided that this claim may have been based on a passage in Brian Crozier’s biography of Chiang, The Man Who Lost China.  This passage does indeed describe Chiang paying regular visits to the brothel district of Shanghai.  There is a major discrepancy with Seagrave’s claim, however; according to Crozier, Chiang’s companion in his brothel excursions is not Tu Yueh-sheng, but Ch’en Ch’i-mei (陳其美).

Ch’en was an important figure in the years immediately before and after the revolution of 1911.  He was also an important figure in Chiang Kai-shek’s life, supposedly responsible for introducing Chiang to Sun Yat-sen. The passage Wilbur found in Crozier reads:

“Chiang’s friend Ch’en Ch’i-mei was his mentor in other things besides revolution. Whenever he frequented the houses of prostitution, Chiang was with him” (44).

In fact, a second passage from Crozier is even closer to Seagrave:

His revolutionary mentor, General Ch’en Ch’i-mei, had introduced him to the “blue chamber” district, with its brothels for all purses. (58)

Here we encounter the “Blue Chamber district” Seagrave mentions. These are the only references to Chiang visiting brothels with a named person that I have found in any of the books Seagrave lists in his bibliography.

But why cite Murphey for Crozier? It is of course possible for Seagrave to mistakenly reference book A for information derived from book B. In fact, it is frequent; there are dozens of examples of this in SD.  But for Seagrave, based on Crozier’s book, to confound Ch’en Ch’i-mei with Tu Yueh-sheng is not possible.  So as Wilbur asks, “Why the switch of names? Could it be because Ch’en died in 1916, and so cannot fit the conspiracy theory?” (132)

This spurious reference is the only one in Seagrave’s entire book claiming to document a tie between Chiang and Tu predating 1927.  Perhaps it is appropriate, then, that Seagrave places their carousing in Robbe-Grillet’s Blue Villa, fictional icing on top of historical falsehood, and a strong justification for the title of Wilbur’s review.

Tu and Chiang’s First Meeting: Huang Chin-jung speaks

So when did Chiang and Tu first meet?  One possible answer is in Brian Martin’s 1996 book, The Shanghai Green Gang: Politics and Organized Crime, 1919-1937.  Martin cites no less a source than Huang Chin-jung (Huang Jinrong), known to Seagrave’s readers as Pock-marked Huang. The meeting took place following the defeat of the warlord forces in Shanghai in 1927. Martin claims that:

According to Huang Jinrong’s own account, as told to a senior Chinese detective in the French Concession police in 1939, he and Du Yuesheng [Tu Yueh-sheng] personally met Jiang’s [Chiang’s] airplane at Longhua Airfield [in Shanghai] on March 26, where he introduced Jiang to Du. (99)

Despite Martin’s enthusiasm, the source is not Huang himself, but a reminiscent account from the detective, Xue Gengxin (薛畊莘), which was published in a 1980s compilation.2 While it still might not be the petrified truth, at least it really appears in a book.

  1. C. Martin Wilbur, “Fabricating History,” Issues and Studies 22.5 (1986): 129–148.
  2. Jiu Shanghai de banghui 舊上海的幫會, 1986, pp. 87-107
Posted in History, Republican China | 2 Comments

Updating WordPress

I finally got around to updating this site’s wordpress installation. The reason for the long delay was that the online updating process from wordpress was broken.  I had originally thought this was because I had used non-standard permissions in the wordpress directory (using ‘hardened’ permissions), but it turns out that this was not the case.

Instead, for some reason, during my last update to wordpress almost a year and a half ago, the ownership of some of the files was not assigned to the Apache webserver. The permissions were okay, but the owner was wrong. This prevents the online update from working. These mis-owned files were actually shown during the latest version of the update process. Is this a new display for the update process, or have I been missing this for the last 18 months??

In any case, the explanation of the failed update was still not very clear in describing the problem, nor was this page.  It took a fair amount of head-scratching and experimenting to finally realize that the answer is simply to chown -R the wordpress directory using the web server name, and the update will work.  Note that if you have any symlinks in your directory, the source file also needs to be chowned, not just the symlink. (If I have misstated this, please let me know!)

Having been put on full maintenance alert, I also took a look at my log files, which showed that my xmlrpc file is being regularly hammered by ‘guests’ who claim to be from the Ukraine. Who knows where they’re really from. This is a well-known problem, so I’ve simply blocked that functionality for now.

In addition, some bot claiming to be from Majestic 12 has been ringing my bell non-stop, so I’ve put up a robots.txt file as well.  Of course that won’t do any good if it’s just some hacker with a Majestic 12 text label on his bot. We’ll see.

Posted in Software | Comments Off on Updating WordPress

Two sources for the Weisberg Collection

There are currently two places to get materials from the Weisberg collection. The main source is naturally the website that Hood College set up for the collection.  The second source is Archive.org. The Archive.org now has a “copy” of the Hood collection, and when searching for Weisberg materials on-line, both of these often turn up in search results.

Curious about this, I finally got around to downloading the Weisberg materials on Archive.org last week and have now had a chance to take a look at them.  They turn out to be significantly different from the materials on the Hood College website, so I’m posting a note on some of these differences.

Archive.org is a gigantic filing cabinet, and it can sometimes be quite difficult to track down the sources of the materials that are put up there.  In this case, it seems the Weisberg materials there were posted mostly by one Mike Best, archivist for the National Security Internet Archive. I haven’t quite figured out who or what NSIA is, except that it is not related to the National Security Archive at George Washington University.  The NSIA was registered at Archive.org in March 2015, and since then it has since posted a huge amount of materials.  NSIA began posting Weisberg Materials in August 2015, and apparently finished putting up what they had by the end of September.

The description of the Archive.org version is at “Complete Weisberg Archive on the JFK Assassination”, which says: “Harold Weisberg donated the world’s largest accessible private collection of government documents and public records relating to the assassination of President John F. Kennedy to Hood College and the Beneficial-Hodson Library at Hood College, which donated a copy to the National Security Internet Archive.”

So this is not just someone scraping the Hood collection, but a copy provided by Hood to NSIA. If you really want the whole thing, there it is: 29 compressed files, over 100 gigabytes even in the ultra-compressed 7z format.  It was quite a job getting all this stuff direct from archive.org. There is a torrent file that might be faster, but the word is that our school throttles torrents, so I did multi-day downloading through archive.org.

Having gotten the whole thing, I’ve had a chance to compare parts of it with the Hood College version, and they are indeed different.  The most important difference is that the Hood pdf files were run through OCR software (apparently mostly Omnipage 18) to convert them into searchable files.  There is a search interface for the OCR versions available at the Hood website, and this is by far the most convenient, effective way of accessing the Weisberg collection.  The Archive.org files have not been OCRed; they are simply images.

This is not the end of the story though.  After some rather hard poking through the NSIA materials, it seems that this is very likely a working copy of the Hood materials.  It’s most useful feature is that it includes excel files for the pdfs in each directory. These excel files have all kinds of important information, such as dates, to-from fields for letters etc, and comments and cross-references to related documents.  These excel files are mostly not available from the Hood website.

Unfortunately, the fact that these are “working” files also has another meaning.  The whole thing seems to have been simply yanked off a hard disk at some point. The most recent files in the NSIA materials are dated 2015-07-12, and there are a number of temporary excel files included in the archive which also have this date. So the backup was done without even closing the excel files that were being edited.  A number of these were clearly not yet done, with numerous inconsistencies in the files listed in the excel sheets and the files actually present in the directory. Some of the excel sheets are even in the wrong directories, with whole directories sometimes misplaced inside other directories as well.

This is not to dismiss the amazing amount of work done on the collection. The large majority of the files are listed, and the large majority of the information listed is accurate, but the Weisberg collection is so huge that “large majority” means there are still thousands of places where there are problems. It is not a trivial task to fix these problems.

It is also worth noting that there are tens of thousands of duplicate pdfs throughout the collection. These are not just duplicate files in the Weisberg collection; there are places where the exact same pdf file is present in multiple locations. Some of this is probably some sort of cross referencing system. An example is that in the giant C zip file, there are dozens of directories of the form CIA [someone’s name].  Most of these appear in other places, with the directory name in the form [someone’s name] CIA. In the second form, however, sometimes the pdf files in these directories are still named CIA [someone’s name]. In cases where they have been renamed, it is almost always the case that they are still the same pdfs, just with the names changed. Some of these duplicate directories also do not appear in the materials on the Hood website and it seems that NSIA copy may represent the Hood archivists’ current efforts in this area.

Despite these problems, the NSIA copy is a useful ancillary to anyone who wants to work with the collection as a whole As an example, the excel indices include a “date” field for much of the collection’s files. According to this, the earliest fully dated document in the collection is Weisberg’s birth certificate: April 8, 1913.  There are also a few documents from after Weisberg’s death in 2002, including the obituary of Weisberg’s wife Lillian, who died March 20, 2003.  The most recent document is a powerpoint file for a 2011 conference presentation by Clayton Ogilvie, the primary archivist for the Hood collection (Presentation-Canterbury 01.pptx, located in the P zip file, apparently not otherwise available either at Archive.org or Hood).  This gives a very useful overview of the collection and its history.  Everyone interested in Weisberg and his materials owes a huge thanks to Mr. Ogilvie and the others who have put so much time into this project.

Posted in History, Research methods | Comments Off on Two sources for the Weisberg Collection