Originally posted by ArMaP
reply to post by idealord
I hope you don't mind, but I converted your raw data to a real (xlsx format) Excel 2007 spreadsheet, with 124,741 records.
It's available here (I hope it works).
Hi ArMaP, Jeff, everyone,
I've now got a new computer with Excel 2007 and have downloaded ArMaP's version of Jeff's spreadsheet.
This has allowed me to access the full number of rows (i.e. over 124,000).
While I plan on getting really stuck into the spreadsheet/index once Xtraeme's downloading tool is up and running, I've had a quick play with the
spreadsheet already.
There seems to be a problem with duplication in ArMaP's version (which may also be in Jeff's original version of the spreadsheet - I don't
know).
The first column in the spreadsheet shows the image number of the record, so there should (presumably) only be one row with the same image number.
However, when I sort by image number there are commonly two, three or four lines with the same image number.
Using the technique mentioned above (drawn to my attention by other helpful members of ATS, in the computer help forum) I've clicked on the "data"
tab in Excel 2007, clicked on "remove duplicates" and selected the first column. This is a fairly quick and easy way to remove duplicates of the
rows for each image number.
Unfortunately, this reduces the number of rows to 48,503...
So (unless I'm missing something, which is a distinct possibility), it seems that the spreadsheet has rows for a bit under half of the image
numbers.
While the current content allows some work to be done (e.g. searching for records relating to certain locations), some of the other objectives (e.g.
sorting by the number of pages on each incident) would require a more complete spreadsheet.
Any thoughts?
All the best,
Isaac