Health Check list Netapp

Embed Size (px)

DESCRIPTION

Netapp health check

Citation preview

NetApp HealthCheck check list:========================

1. Check the logs in the messages file in /etc folder to see any errors and warnings reported.

# rdfile /etc/messages from filer or \\filer\etc$\messages from win box

2. Check failed disks. if found any, open a case with netapp providing required details (rpm and size ,type) along with serial number.

# vol status f

3. Check the volume and snapshot volume utilization.

Make sure all the volumes below 90% space utilized and .snapshot volume space is not above 100% utilized. If any volume is found over utilized and snapshot volume is below 100% utilized then send mail to help desk to delete some unwanted data to free up space.

If snapshot is above 100% utilized then ask to delete some old unwanted snapshots.

If it is not possible to free up the space in above cases then check the space in existing aggregate to plan space addition.

# df h

# snap list volume

# aggr show_space g aggrname

4. Check the snapmirror status and act accordingly if the status is other than idle with acceptable time stamp.

#snapmirror status

5. Check storage box environment status to see any shelf or fan errors.

# environment status shelf

6. Check aggregate utlisation to ensure sufficient space is available for future.

# df Ah

7. Check the network interfaces status make sure that, no inter faces in vif showing brocken.

# Ifconfig a

# Vif status

8. Check cifs status see any broken AD connections.

# cifs domaininfo

9. Check virus report to see engine is active and running fine and no scan failures reported

#vscan scanners

10. Check the initiators status to see all the hosts have active connection to storage filer and all the hosts which are luns assigned is having active connection to storage filer.

# lun show m

# igroup show v

11. Check the snapvault status if it is configured on the filer to see vault backup is up-to-date.

# Snapvualt status

12.Check FCP and ISCSI services are running if configured.

#iscsi status

#fcp status

13.Check the cluster status to see high availability is enabled without errors.

#cf status

#cf monitor