I am starting to migrate the night data pipeline from the ETL visual tool to Luigi, and I really like that there is a visualizer to see the status of tasks. However, I noticed that a few minutes after the completion of the last task (with the name MasterEnd ), all nodes disappear from the graph except MasterEnd . This is a bit uncomfortable, as I would like to see that everything is completed in the day / past days.
In addition, if in the visualizer I go directly to the last URL of the task, he cannot find any history in which he worked: Couldn't find task MasterEnd(date=2015-09-17, base_url=http://aws.east.com/, log_dir=/home/ubuntu/logs/) . I confirmed that he successfully managed this morning.
It should be noted that I have a cron that runs this pipeline every 15 minutes to check the file on S3. If it exists, it starts, otherwise it stops. I'm not sure if this causes the removal of tasks using the visualizer or not. I noticed that it generates a new PID for each run, but I could not find a way to save one PID / day in the documents.
So my questions are: is it possible to save the completed chart for the current day in the visualizer? And is there a way to see what happened in the past?
Appreciate all the help
jpavs source share