I have a large list of files, some of which have dates embedded in the file name. The date format is inconsistent and often incomplete, for example. "Aug06", "Aug2006", "August 2006", "08-06", "01-08-06", "2006", "011004", etc. In addition, some file names have unrelated numbers that look like dates, such as โ20202010โ.
In short, dates are usually incomplete, sometimes do not exist, are inconsistently formatted, and are embedded in a string with other information, for example. "Report Aug06.xls".
Are there any Perl modules available that will do a decent job of guessing the date from such a string? It should not be 100% correct, because it will be checked by a person manually, but I try to make everything as simple as possible for this person, and there are thousands of records to check :)
date perl nlp
El yobo
source share