Fri Nov 17 20:10:46 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.14

Sun Nov 12 14:13:36 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Gotta set home abd base correctly for Cathersiscomic

Sun Nov 12 14:13:10 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Rebuilt chaptered list to use if_changed.

Sat Nov 11 22:10:12 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Flip to scan url for index numbers first. seems to work better.  also stop referenceing undeffed var.

Sat Nov 11 22:09:01 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Swtich catharsiscomic to custom work, need to get around date format flipping.

Sat Nov 11 14:18:15 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * string not symbol.

Sat Nov 11 14:03:08 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Starting using the new date patterning code.

Sat Nov 11 14:02:01 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Use build_item.

Sat Nov 11 13:56:17 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Rebuilding how scanning dates out of strings all works.

Sat Nov  4 20:02:52 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * commentary

Sat Nov  4 19:42:03 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Cleaned out the last of the comic scrappers that used blaze.

Sat Nov  4 18:43:02 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * new comic Castlevania RPG and SkyFall

Sat Oct 28 18:30:22 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Take advantage of new date finding stuff.

Sat Oct 28 18:29:49 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Cleaning up some odds and ends.

Sat Oct 28 17:58:06 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * put the fancy date string finding into the new of RSSItem.
  Aiming for more accurate listing of the links scraped, doing some fancy
  detection of date strings.  Only really changes things when they're found.
  Otherwise should be like before.

Sat Oct 28 14:09:14 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * rewrote and optionlist scanner the hard way to see what I'm dealing with.

Sat Oct 28 14:08:51 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Stop using the date scanner thingy. its broken.

Sat Oct 28 13:34:52 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Start using the better date finding stuff.

Sat Oct 28 13:17:10 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * moved frightmoore university to old comics.

Sat Oct 28 13:16:39 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Start laying down some code that does better finding dates for items.

Sat Oct 28 13:15:20 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * start cleaning api docs. move anchorlist to new format.

Sat Oct 28 11:42:38 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Trying out the new api stuff.

Sat Oct 28 11:41:39 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Adding some more new api stuff. Leaving a few cookies on things to do later.

Sat Oct 28 10:04:39 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Move some stuff around.

Sat Oct 28 10:04:08 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * rebuild countyoursheep scrapper with new scrax api design.

Sat Oct 28 09:57:28 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * start of api redesign.
  So I basically have been playing with why's stuff too much.  There is a
  level of bendiness that is really nice, and this doesn't have, and I want.
  So I'm slowly trying to bend everything back that way, without breaking
  things.  Will see if I pull it off or not.
  
  The previous design of scrax is very very single use, single fire.  I want
  to get away from that.  Most of this, I think, is a careful generalization
  of the parts.
  
  A good examle: The original design of scrax watches a single page.  There
  are plenty of examples however, where you need to follow a link or two
  before you get to the page you actually want.  Doing that in the old design
  required changes internal to scrax.  Not very flexable.  The new design
  lets you grab-if-changed data of pages as needed.

Wed Sep 13 15:37:19 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Add limit to friendly hostility

Wed Sep 13 11:28:30 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Log message before adding rssitem.

Wed Sep 13 11:26:23 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added comix Friendly Hostility

Thu Aug 17 16:59:25 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Oots changed the way they do things.

Sat Jul 22 13:55:15 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added atom output, not used or accessible at the momment.

Sat Jul 22 13:21:19 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.13

Sat Jul 22 13:19:54 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Fixed regexp that clobbered entities.

Sat Jul 22 13:10:27 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * reordered things in xml output.

Sat Jul 22 12:57:55 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * guids are never links.

Sat Jul 22 12:44:14 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Make the guid function return somethng useful even when the date isn't

Sat Jul 22 11:56:42 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Try to make guids in rss unique over time as well as space.

Sat Jul 22 11:56:17 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Update Changes file always.

Sat Jul 22 11:14:44 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Fixed calendar scrapper.
  Seems people have started to put complete tables into the <td/> where the
  month and year are placed.  So needed some code to make sure we grab the
  complete <td/> and then only pull out the month and year from it.

Sat Jul 22 11:02:50 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Clean up error messages.

Sat Jul 22 09:55:41 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Add random number to errors so they always appear unique.

Mon May 22 11:22:21 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Forgot to require installed scrax verses debug scrax.

Mon May 22 11:15:40 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.12

Mon May 22 11:13:06 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * various touchups 

Mon May 22 11:12:52 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Add new comix Slightly Damned

Mon May 22 11:10:43 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Timestamp logs, no port, report 404
  Seems that adding the port to the Host header is wrong. so stop that.
  Also dump out a message when the link 404s.

Mon May 22 11:10:06 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Document all debug flags.

Mon May 22 11:09:28 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Build file touchups

Sat Apr 29 12:49:25 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * yes, really, no more CGI.

Sat Apr 29 12:46:50 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * builder escapes html for us, so don't need cgi anymore

Sat Apr 29 12:04:32 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.11

Sat Apr 29 12:02:02 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Touch up documentation

Sat Apr 29 11:51:42 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Use PackageTask and GemPackageTask instead of doing it all by hand

Sat Apr 29 11:33:35 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Dominic Deegan has parsable archive again.

Sat Apr 29 11:28:35 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Improved Scrax::RSS classes
  Replaced my hand crufted rss strings with XML Builder calls.  Makes for
  cleaner xml.
  
  Also went and added support for real dates in rss items.  Still support the
  simple integer ids for when you cannot parse out a date for a scrapper.
  Typically, If you can figoure out a date for an item use that, otherwise
  make up some unique, ordered value.
  
  Tweeked keencalender to provide date info to rssitems, since it always
  knows.  Other provided scrappers still use ids.  I'll see if I can find a
  sane way of guessing if there is an encoded datestamp, but that's later.
  

Sat Apr 29 11:27:56 CDT 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Tabs to Spaces

Wed Mar 22 09:36:36 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added clean rules and html rule.

Wed Mar 22 09:20:19 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Rake Rules to build html project page from README and darcs changes

Mon Mar 13 21:38:10 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Moved base link for Keven and Kell again.

Mon Mar 13 21:37:30 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added comic leveL

Tue Feb  7 16:47:10 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.10

Tue Feb  7 16:46:49 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Timestamp debug messages.

Tue Feb  7 16:29:42 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added some more debug flags.

Tue Feb  7 16:29:10 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added comic Sequential Art

Tue Feb  7 16:28:27 CST 2006  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * KevenKell changed hosts.

Thu Dec 29 10:32:58 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Starting to update latestpage to the new net/http interface.

Thu Dec  8 11:50:28 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Dominic Deegan no longer maintains an archive page, so we have nothing to scrape.

Sat Nov 26 09:56:25 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.9

Sat Nov 26 09:53:11 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Actually print when I say I'm going to

Thu Nov 24 20:27:17 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * pretend no changes on timeout

Thu Nov 24 20:26:59 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * debug log message added

Wed Nov 16 08:58:03 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Made Wotch's pattern more specific.

Fri Nov  4 10:44:33 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Tweeked packing order, added some stage messages, made zip quiet.

Thu Nov  3 16:28:43 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Tweeking readme

Thu Nov  3 16:19:04 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Replace, not append.

Thu Nov  3 16:17:53 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.8

Wed Nov  2 11:31:56 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Removed include directory hack.  

Wed Nov  2 11:06:18 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More Comics, Ctrl+Alt+Del

Wed Nov  2 10:54:51 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * API Document additions, debug flags.

Wed Nov  2 10:31:52 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Remove deperciated scrax.base methods

Wed Nov  2 10:31:02 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Touch ups in rakefile.  gem generates rdoc now.

Wed Nov  2 10:30:42 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Touch ups in readme

Wed Nov  2 10:16:55 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * added a rakefile for packaging. Things got to complex for darcs dist.

Tue Nov  1 16:21:35 CST 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More Debug stuff for people trying to write their own scraxes
  Filled out the Debug flags a bit more.  Added messages in each of the scrapers
  for the :found_page debug flag.  Added a :reset_lastfetch flag so you don't
  have to go delete the state file.
  
  Also, you can set the flags for a single run via environment var.
  fe:  env SCRAX_DEBUG=reset_lastfetch,found_page ./yourscraxscript.rb
  the value is just a comma seperated list of the flags without the ':'
  
  I still need to add this to the API docs though.

Sat Oct 29 11:29:37 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.7

Sat Oct 29 11:23:37 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Don't clear saved rss unless there is new rss

Sat Oct 29 11:21:43 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Ment to replace if not Array

Sat Oct 29 11:11:23 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Let page url be based from chapter url's dirname.

Sat Oct 29 11:09:07 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Debug tools
  Added a function for printing debug messages, and made a way that these
  can be set via the Scrax.new options.  Only chaptered_anchor_list is using
  it right now.
  It is intended to help people that are making new scrax scripts.  More work
  that can be done here.)

Sat Oct 29 11:08:27 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More Comics

Fri Oct 28 14:17:25 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * No New updates to Reman Mythology, going to print now.

Fri Oct 21 14:08:22 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More Comics

Thu Oct  6 15:17:24 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Comedity now has RSS

Wed Oct  5 20:28:01 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.6

Wed Oct  5 20:23:14 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * DMFA now has and Archive of Chapters to Pages, switch to use that

Wed Oct  5 20:20:56 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Chaptered Anchor list scraper
  This is for pages that have a list of links that point to pages that are
  a list of links.  And it is that second level of pages that you want.
  Most commonly found as a list of Chapters, and then articles within.

Mon Oct  3 15:53:38 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Fri Sep 23 17:27:07 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Two Lumps changed url

Sat Sep  3 13:46:09 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.5

Sat Sep  3 13:45:51 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Forgot a greaterthan

Sat Sep  3 13:37:43 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Switch Base
  I want to depreciate Scrax.base and Scrax.base=, so I better stop using them.

Sat Sep  3 13:18:47 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More API documentation

Sat Aug 27 18:12:43 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Tweeked the formatting of the readme

Sat Aug 27 18:12:09 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Added custom blazer for The Wotch

Tue Aug 23 12:38:43 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Devil's Panties changed URLs

Thu Aug 18 13:56:51 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.4

Thu Aug 18 13:53:27 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More API Comments

Thu Aug 18 13:42:15 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Get some pref keys set based on scrax.new params.

Thu Aug 18 13:32:15 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Changing and expanding preferencing
  The start of a new way of dealing with preferences and more importantly
  the various details that are needed to generate useful rss feeds.

Thu Aug 18 13:19:21 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * More API documentation

Thu Aug 18 11:37:29 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Allow Regexp or String as First param

Thu Aug 18 11:34:47 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Touching Up API Documentation
  Reading up on rdoc, and touching up the API docs.  Since I did rather intend
  this to be a usable library, I had better tell people how to use it.

Tue Aug 16 17:44:41 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Better Regexp for optionlist

Tue Aug 16 17:42:57 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Customize paradoxlost's blazer
  The way the option list for Paradox Lost is set up is just weird enough
  that it might as well be customised.   Mostly to get sane id values out
  of the data provided.

Tue Aug 16 10:46:00 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Touch up and add some comments
  Adding better comments to the functions so I can build rdocs later.

Tue Aug 16 10:33:28 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Custom blaze for Fallen
  The way that the archive links are layed out for Fallen are heavily repeating
  which makes the default anchor scraper not quite working.  The links are, 
  however, well organised from a higher level in the page.  So we can use that
  to keep things in line.
  
  That means that this comic needs a custom blazer.

Mon Aug 15 14:16:58 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.3

Mon Aug 15 14:04:18 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * New Anchor List Scrapper
  Rewrite of the anchorlist scraper to use htmltokenizer instead of a single
  regexps.  Much more flexable now on how-so-ever the page might encode the anchor
  tags.  Should fix up a few bugs that people have found.
  (Frederic Cote found a bug where pages that had no quotes on the href failed.)

Thu Aug 11 15:13:02 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.2

Thu Aug 11 14:28:22 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Thu Aug 11 14:04:53 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Detect Bad Scraping
  Detect failed scrapes by noting that the rss list after a blaze is empty.
  Then fill rss with a warning item, so users can easily see that something
  needs doing.

Thu Aug 11 13:48:24 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Anchor list regepx needs to be case insensitive

Thu Aug 11 11:31:07 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * RequirementListInReadMe

Thu Aug 11 10:05:44 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  tagged Release-0.1

Thu Aug 11 09:56:52 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Licensing

Thu Aug 11 09:35:30 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * RewroteReadme

Wed Aug 10 17:28:53 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * ExcludeInsteadOfInclude
  its easier to list which files to not mark as exec instead of adding then
  all the time.

Wed Aug 10 17:25:18 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * CalendarNotCalender

Wed Aug 10 17:23:08 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MiscCleanUps
  Few little touchups in comments and whitespace
  also, since id is a string, pad out numbers with leading zeros.
   Wonder if I should id be a number? Or date?
   Need to think about this more, but this works for now.

Wed Aug 10 11:00:31 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * NewCalendarScrapper
  Gave up trying to build the calenar scrapper with regexps.  So now we're using
  a full tokenizer with something of a grammer.  Does a *much* better job.
  
  This does add scrax's first outside requirement: html/htmltokenizer
  ruby gems has it, so just grab it from there.
  

Wed Aug 10 10:59:38 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Respelling

Tue Aug  9 21:00:09 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Fri Aug  5 12:46:11 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * AddSomeLimits

Fri Aug  5 12:44:09 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * limitCleanUps
  tweeked in comparible, and try for some uniqueness.
  also, seems that for both anchor and option lists, the href/value is a better
  place to be grabbing ids from than the title.  So do that.
  also default limit to 20.

Fri Aug  5 12:11:26 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * FirstHackAtLimits
  well, this is the first hack at doing limits.  The idea is that there is 
  and ever increasing id for each entry.  Higher id is newer entry.  This
  will hopefully map across both flat indexes and date based indexes.
  It seems to.
  Things that don't fall to these two types however...
  Lost more work needed here.
  

Fri Aug  5 10:13:48 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * scrapperChanges
  Trying some new code for the keenspace calendar scanning. still has issues,
  but is getting better.
  
  Slight change to the option list scanner.  Should handle more variances.
  

Fri Aug  5 10:13:03 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Sat Jul 30 17:26:57 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComix

Thu Jul 28 15:09:57 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Thu Jul 28 14:20:36 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * DealWithExtraAttributes

Thu Jul 28 13:58:07 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Thu Jul 28 13:53:10 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MoreComics

Thu Jul 28 13:44:49 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * KeenspaceOptionalBase
  Seems the calendars in keenspace archives may or maynot have the base part
  of the url, so add that to the regexp.

Thu Jul 28 13:30:11 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * SkipCrapAfterHref
  Needs to still match an anchor if there are other attributes between
  the href and the >
  

Thu Jul 28 12:19:22 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * LibraryLoadingHack
  So, since I want this to pretty much run out of this dir. (The idea is that
  someone downloads this and keeps it all right where ever it lands.  And it
  just works. (no install)).  But that means that this directory, where ever
  it ends up, needs to be in the search paths for ruby modules.  So before 
  we require scrax, we put the directory of this script into the searched
  paths.  
  Given that I expect most to run this from NNW, and it uses full paths as 
  script names, this works.  (it shoudl work for most relative paths too I
  understand.  will see i guess.)
  

Thu Jul 28 12:18:47 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * ValidRSSTwoDotZero

Thu Jul 28 12:03:44 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * ForgotABrace

Thu Jul 28 12:02:18 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * SomeComics
  So this is just a quick grab bag of some comics.  More will get added later.
  Also added a quick script to +x them since darcs doesn't remember that.

Thu Jul 28 11:53:37 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * Readme

Thu Jul 28 11:25:17 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * AddedOptionlistScraper
  This actually works for two rather different lists. so happy. (evil html)
  may or may not work for others. wheeee.

Thu Jul 28 10:46:07 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * TouchUpcalendarpage
  Cleaned up the date-as-title.

Thu Jul 28 10:44:30 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * StartToDetect
  
  A trap for errors and some comments on what could be added to try and autodetect
  when a page changes enough that the scraper doesn't get what we want anymore.
  

Thu Jul 28 10:04:21 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * MakeSureThingsAreStrings

Thu Jul 28 09:58:56 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * baserefs
  looking like most everything is going to need a baseref specified so put
  in a consistent way of doing that.
  

Thu Jul 28 09:48:04 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * ReuseRSSObject
  Try to reuse the base RSS object if we can.  Keeps the title and uri info
  around easier that way.

Thu Jul 28 09:46:44 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * NowIfNotModified
  Not every page response has teh last-modified header filled.  For those that
  don't have it, just use now.
  

Thu Jul 28 09:31:09 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * DateAndBase
  
  Parsing a nil in latestpage was giving a time of now instead of erroring out
  and being the default of '1990'. fixed that.
  
  Also added an option to anchorlist to supply the baseref for links.  This
  might move if other scrapers start using baserefs to.
  

Thu Jul 28 09:05:31 CDT 2005  Michael Conrad Tadpol Tilstra <tadpol@tadpol.org>
  * InitialImport
  
  So this is the first hack at the base libs for scrax.  Hopefully there is
  enough foundation stuff here to get going.  Am going to need more blaze 
  wrappers for various page types.
  Also, really need the parse error detection stuff I have been thinking of.
  (something that push a rss item when it thinks the scraper need rewriting
  because things didn't quite turn out how it expected.)
  
