Week 46 Changelog
Added
- Backend: authors are now retained in the
hnbooks_ranksweekly index documents instead of being stripped out, by indexing the originalentrypayload directly. This sets things up for future features that need weekly author data.
Changed
- Crawler author handling: Google Books and Open Library imports now both write to a single
authorsfield instead of maintaining separategb_authors/ol_authorsvariables, simplifying how authors are stored. - Weekly ranks indexing: the weekly indexer no longer builds a shallow
doccopy or removesauthorsbefore indexing; it sends the fullentrythrough to Elasticsearch. - Top-N generation logic: when building the weekly top list, the crawler now calls
get_book_info(url_entry)before checking for a missingstorefield, and only overwritesbook_infofor carry-over entries that don’t havestoreset (i.e., items reused from last week). - Newsletter CTA copy: the “💬 Tell Us What You Think” section is now “💬 What Did You Read This Week?”, asking readers to share their reading log so you can feature a reader-suggested book.
Removed
- Sample ES doc: removed
hnb-book-crawler/crawling/doc.json, which contained a hard-coded example document and is no longer needed. - Frontend author display: removed the
.book-authorsCSS block and the author byline markup from bothbook.html(book detail page) andbooks.html(weekly list cards), along with theauthorsnormalization helper ines_books.py. Authors continue to exist in the data, but they’re no longer rendered on the site UI for now.
Next up: Book descriptions (hopefully)