So, I have been using the application engine for a long time without any problems. I know that if the application has not been affected by the visitor for some time, the instance will be closed, and the first visitor to the site will have a delay of several seconds until the new instance begins.
However, recently it seems that instances remain alive for a very short period of time (sometimes less than a minute), and if I already have one instance and I refresh the application’s web page, it still fires another instance (and the start the page is the minimum HTML start page, does not require a lot of CPU / memory). Looking at my magazines, he constantly launches new copies, which has never happened before.
Any tips on what I should look at, or any ideas on why this is happening?
In addition, I use Python 2.7, threadsafe, python_precompiled, inbound warm-up services, NDB.
Update:
So, I modified my application to have at least one standby instance, hoping this solves the problem, but it still starts new instances, even if one resident instance is already running. Therefore, when there is only one resident instance (and I do not receive any traffic except me), and I go to another page of my application, it still starts a new instance.
Also, I changed the expected delay to 1.5 seconds, as the coma noted, but that doesn't seem to help.
The use of memory instances is always around 53 MB, which is surprising when the pages being called don't do much. I use the F1 Frontend Instance Class and has a limit of 128, but in any case 53MB seems high for what it should do. Is this an acceptable size on first start?
Update 2: I just noticed on the toolbar that over the past 14 hours, the request / _ah / warmup has answered 24,404 with errors. Could this be related? Why would they answer 404 response status?
The main question: Why does it constantly launch new instances (even without traffic)? Especially when there are already existing instances and why do they close so quickly?