It would be helpful for operations to have a page that allowed you to view all of the errors within a given window, say 24 hours as default, for an environment. Something like the linux equiv of:
cat errors | sort | uniq -c | sort -nr
This could allow for a quick view into an environment to see what are the major problems plaguing the users, be it mundane syntax errors, IO failures, etc.
Currently, having to browse wu's, get all failed state jobs, individually click on each one, expand the error message, go to the next failed job, and repeat to slowly get an idea if their is a real problem or not, takes a lot of time.