I want to restrict crawler access to my rails device running on Heroku. This would be a straightforward task if I were using Apache OR nginX. Since the application is deployed on Heroku, I'm not sure how to restrict access at the HTTP server level.
I tried using the robots.txt file, but distorting scanners do not comply with robot.txt.
These are the solutions that I am considering:
1) A before_filter in the rails layer to restrict access.
2) Rack-based solution to restrict access
I am wondering if there are any more efficient ways to solve this problem.
source share