Fast and robust date extraction from web pages, with Python or on the command-line
Htmldate is a Python package that allows users to find the original and updated publication dates of any web page. It offers a range of features including flexible input options, customizable output formats, multilingual support, and compatibility with recent Python versions. The package uses heuristics to sift through HTML markup and text elements to identify dates accurately.