vCenter Server rollup jobs and processing performance data
vCenter Server rollup jobs and processing performance data
Determining if vCenter Server rollup jobs are processing performance data (2007388)
Symptoms
- Excessive growth of the vCenter Server database.
- vCenter Server rollup jobs do exist.
- Slow response or timeouts when retrieving performance data.
- Only Real Time data is available when looking at performance data.
- When accessing performance data for a period other than the last 24 hours, you see the message:
Performance data is currently not available for this entity
Resolution
Architecture
- Past day stats rollup – Every 30 minutes
- Past week stats rollup – Every 2 hours
- Past month stats rollup – Every 24 hours
- Whether an Interval Duration is Enabled or Disabled.
- How long the interval between samples is for the statistic level
- This is only available for the first statistic level (5 minute(s) interval in the diagram above).
- How long the information is kept for in the statistic Level
- By default we only keep the statistical data for 1 year, and after that it is purged. It is possible to extend this, and also to keep the data for specific statistic levels for a longer length of time if this granular level of data is needed.
- The statistics level for the different rollup levels
- By default the Statistics Level in vCenter Server is set to Level 1 for each level. This controls the amount of data that is gathered for the level:
- Level 1 – Includes the basic metrics Average Usage for CPU, Memory, Disk and Network, System Uptime, System Heartbeat, and vCenter DRS Metrics. Statistics for devices are not included at this level.
- Level 2 – This level includes all metrics for CPU, Memory, Disk and Network counters (average, summation and latest rollup types – maximum and minimum rollup types are excluded), System Uptime, System Heartbeat and vSphere DRS metrics. Statistics for devices are not included at this level.
- Level 3 – This level includes all metrics (including devices) for all counter groups (average, summation and latest rollup types – maximum and minimum rollup types are excluded).
- Level 4 – This level includes all metrics supported by vCenter Server.
Note: VMware does not recommend setting the Statistics Level higher than Level 2 unless debugging an issue. The amount of data collected is substantially greater and without adequate processing power on the SQL server could cause performance data to be not collected properly.
- By default the Statistics Level in vCenter Server is set to Level 1 for each level. This controls the amount of data that is gathered for the level:
Diagnosis and Resolutions
- If you are running Microsoft SQL (MSSQL), run the command:
exec sp_spaceused vpx_hist_statx
Where x is the statistic table
The output is similar to:
The Rows and Data columns show the amount of data that exists in the database. An acceptable amount is dependant on the size of the environment, but there could be a problem if you are seeing more than 10 million rows in the vpx_hist_stat1 database table.
- For MSSQL, Oracle, and DB2, run the command:
select count(*) from vpx_hist_statx
Where x is the statistic table you are interested in
This query returns the number of rows in the database table specified.
- Statistic rollup jobs do not exist. In an upgrade or recovery scenario, the database (although properly restored), may not include the restoration of the rollup jobs. Validate within MSSQL or Oracle whether the Past Day, Week and Month rollup jobs exist and recreate them if necessary. For more information, see Updating rollup jobs after the error: Performance data is currently not available for this entity (1004382).
- The MSSQL agent service is not started on the Database server. By default when SQL is installed the MSSQL Agent service is started in Manual mode. On reboot the rollup jobs may not run as the service is not started. Validate this configuration by checking the agent services and making sure that the SQL agent service is set to started and automatic.
- Statistic Collection Levels are set too high for the given configuration. If statistic collection levels above level 2 are used, other than for debugging an issue, growth of the database may occur. Reducing to a lower level stabilizes the system, but it may not be possible to recover.
Truncate the non-processed information from the vpx_hist_stat1 table is normally a last resort, but can be the ultimate solution if it is not possible to process the data in an appropriate period of time. To truncate the non-processed information, run these commands:
truncate table vpx_hist_stat1
truncate table vpx_sample_time1
These commands delete the data that has not been processed. The remainder of the historical data prior to experiencing the issue is left intact.