Kennedy Changelog
2023-03-19
- Massive improvement to Delorean, making it store a history of cached versions of content, and not just the copy found in the most recent crawl
2023-01-27
- Redesign of crawler code which improved speed of the crawler. Robots.txt files are downloaded ondemand instead of requiring a pre-flight step, ensuring that all capsules with Robots.txt are respected
2022-08-06
- Updated "Page Info" view to support image meta data (dimensions, format, text used in index)
- Updated Delorean to work show cached images and other cached, non-text content
2022-07-26
- Added image search! Images are indexed based on the text in their file path, as well as the text in all their inbound links
2022-06-04
- Updated searched Also include snippet for Gemipedia about the search query and link to Gemipedia entry
2022-03-01
- Added a "Page Info" view that shows title, language, # lines, size of response, and incoming/outbound links to a page
- Improved Delorean by adding a "View Cached" link for each page in the "Page Info" view.
- Streamlined the meta data shown on the search results page into a single line and made it a link to "Page Info" view.
- Improved "title" extraction code to use the first header encountered, regardless of level, or alt text from the first pre-formatted section.
2022-02-21
- Added Delorean which lets you view cached content from most recent scan by providing a URL
2022-02-14
- Added route/view for showing capsules with valid security.txt files