We're soon launching a serverless application hosted on AWS that's heavily dependent on Lambda and Step Functions, but we're having trouble tracking and debugging errors and failures when they occur. We'd like to work with a developer with prior experience customizing CloudWatch to improve application visibility for the dev and QA teams.
Some potential features we'd like to implement:
* CloudWatch dashboard showing the rate/frequency of Step Function failures
* Sending error notifications (via email or Slack)
* Pulling relevant summary data out of failed Step Function executions
* Deduplicating failed executions if caused by the same underlying issue
* Integration with Jira
* Checklist functionality to mark an issue as resolved
About the recuiterMember since Sep 1, 2017 Isabella
from Fukuoka, Japan