Recently we started getting bombed with these errors and until now I still can’t fully solve these. I’ve ran all the SQL cleanup scripts and tuned noisy monitors etc, but these errors persist against both MS.
Alert: Data Warehouse event data dedicated maintenance process failed to perform maintenance operation
Alert: Performance data collection process unable to store data in the Data Warehouse in a timely manner
Alert: Event data collection process unable to store data in the Data Warehouse in a timely manner
Alert: Object Health State data collection process unable to store data in the Data Warehouse in a timely manner
Alert: Data Warehouse configuration synchronization process failed to write data
I’ve watched the SQL Profiler, the SQL Activity Monitor, the SCOM SQL Database Summery Dashboards, SCOM Management Data Access > Console and Data, etc.
In all the perf data I can’t identify where the bottlenecks are even though I can see some spikes, delays and the alert notification balloons in the graph. I find no great latencies or bandwidth issues.
I’m really getting to wits end on how to solve this. There is no particular time of day for these.
The LAST thing I saw as a recommendation is to adjust the “Data Warehouse Relationship Synchronization Data Writer Recovery State” monitor from the default “max allowed recovery time (seconds) from 300 to 600. This doesn’t seem to have made a difference, and there are many of the recovery state timers I could OR to the 600 second value. Should I be more comprehensive in these?
I’ve read all the articles I could find by Kevin Holman, Cameron Fuller, Bob Cornellissen etc.
Has anyone else been through this? Any ideas of what else to tune? Any other tests or benchmarks?