Skip to main content
  • Place orders quickly and easily
  • View orders and track your shipping status
  • Enjoy members-only rewards and discounts
  • Create and access a list of your products
  • Manage your Dell EMC sites, products, and product-level contacts using Company Administration.

Dell Unity: How To Gather The Necessary Information For A Unity Performance Analysis

Summary: This article will show you how to gather Unity Performance Archives (UPA files) for diagnosing performance issues on Unity arrays.

This article may have been automatically translated. If you have any feedback regarding its quality, please let us know using the form at the bottom of this page.

Article Content


Instructions

Issue
  • Poor Read and/or Write performance or high response times for connected hosts.
  • Response times for host applications have increased or are unacceptably high.
  • Performance problems on a Dell Unity array, which requires performance analysis.
Performance issues can have a number of causes including network configuration, congestion, host configuration, hardware faults, disk contention (several busy LUNs sharing the same drives), or the storage array being at its maximum utilization.

Unity Technical Support will not be able to fully investigate and troubleshoot Performance issues until the following have been provided:
  • Service Data File (Data Collect)
  • UPA files (Unity Performance Archives)
  • A detailed description of the symptoms experienced in the customer environment
Failure to promptly provide the information in this LKB can result in longer resolution times.

The details of the performance issues (symptoms, time stamps, etc.) are crucial so that the engineer knows where to focus (times/LUNs/hosts/etc.), rather than spending several hours just to find the issue (not the cause). Without this detail, hours may be spent searching for an issue and we may only be able to possibly only analyze the general performance statistics.
 
Scope
Customer Support Services investigate performance cases to identify issues that are affecting customer sites. Performance tuning (to achieve peak performance for a host or application) and reconfiguration is a task for a Storage Administrator or a Solutions Architect, so this will not be handled in a Break/Fix Service Request.

Service Requests that do not involve an actual problem, such as requesting a report of the current level of workload on an array, are outside the charter of Support. For self-help or those needing help to determine the level of workload on an array can use the LiveOptics service. Please see the following article for detailed information about this option:  
Data And Information Requirements For A Detailed Performance Analysis
For each event, Technical Support needs the following logs and information which cover the incident time frame:
  • Service Data Files (also known as Data Collects), generated shortly after the issue occurred.
    • If Replication is in use, then Data Collects and UPA files from both arrays will be needed.
  • Unity UPA files (Unity Performance Archives), which cover the whole period when the problem was occurring.
  • Host Grabs and Switch logs may also be required if the problem appears to be outside of the array.
  • Unity Network Traces may also be required.

BLOCK ISSUES
  • Provide a clear and detailed problem description:
  • When did/does the problem happen? (Date / Time & Time Zone)
  • Which LUNs are affected?
    • This is very important. We need to narrow down the search.
  • Is Data Reduction or Advanced Deduplication enabled on the LUNs experiencing issues? Is Data Reduction enabled for other LUNs? (Please see the Unity Best Practices White Paper - CPU Utilization section - page 9)
  • Is Replication in use? Please provide information about the RPO for Replication and if the performance issue improves if Replication is paused.
  • Is there or was there at any time active Data Unavailable?
  • Is anything else running at the specified time? (Backup, Jobs, Batches, Virus Scans, Host Disk Optimization (TRIM/UNMAP), etc.)
  • If something is not working as it should, did it ever work? Or is this part of a new implementation/setup? Were there any new applications or hosts added recently?
  • How is performance being measured or what is the impact? Delays or errors on the host side? Are end users reporting slowness?
  • Any recent changes in the SAN environment (hosts, switches, array)?
  • Impact (Affected applications, # of affected servers, # of affected users, etc.)
  • How often does the problem happen? (Constantly / Regularly / One-Time / Randomly)
  • What are the names of the affected hosts?
  • What topology is used? FC, iSCSI or Ethernet? (If using iSCSI all hosts MUST have TCP Delayed ACK disabled - see KB 000079979 and search for TCP Delayed ACK.)
  • If the issue may be around connectivity, we will likely need a topology diagram.
  • Does a known problem exist in the network at the same time or has the network been upgraded recently? Also check that iSCSI is not used on the internal LAN and any switches used must be Enterprise class switches.
 
FILE ISSUES
  • Provide a clear and detailed problem description:
  • What are the specific protocols involved?
    • Example: NFS (version), CIFS (version), NDMP, iSCSI, FTP, etc.
  • Is the issue related to a specific export/share, filesystem/LUN?
  • What time of day does the issue occur? (Date / Time & Time Zone)
  • What is the active workload at the time?
    • Example: login, logout, backup, batch jobs, etc.
  • What hosts are involved?
    • Example: Backup, VMs, workstations, servers, etc.? 
    • Get as much detail on the application(s) as possible.
  • What does the Network configuration look like?
    • Provide specific interface/NIC detail.
  • Are the network interfaces isolated on separate NICs?
    • Example: replication traffic on it's own/single NIC

Collecting Service Data File (Data Collect)

To gather Service Data File (Data Collect), see LKB 000023676. The output files from running Service Data file on the array will be necessary to show the detailed array configuration and any events which occurred on the array which may have had a performance impact. We recommend that you gather the Service Data File (Data Collect) as soon as possible after the event occurs.

  • 000023676: Dell Unity: How to Gather Service Data (Logs) from a Unity Array (User Correctable) https://www.dell.com/support/kbdoc/000079979

Keep in mind that the Service Data file will only contain the last complete UPA file (one hour) and any available -tmp.archive UPA file. Often, this is not enough information for a proper performance review and additional UPAs will be required.

To find the UPA file within the Service Data File (Data Collect), go to spx\cmd_outputs\metrics\. 

  • spx = Primary SP

  • Example: If SPA is the active SP, the UPA will be under spa\cmd_outputs\metrics

 
Collecting UPA Files
  • Support may ask for additional UPA files (versus the UPA available in the Service Data File (Data Collect)).
  • Unlike the Service Data File (Data Collect), the array will store at up to 48 hours of UPA files. 
  • The UPA files will look like the below example. Each file name contains the date and time of the first data point in the file. The interval will be 10 seconds and cover one hour minus 10 seconds. The date and times in the filenames are UTC/GMT.
    • Note that the last UPA file in the example below is the most current archive (-tmp.archive). This contains the current performance data and can be copied from the array (via SSH/CLI only) to look at the most current data. We can use this file to see performance events without needing to wait until the next hour.
Example of UPA files stored on the Dell Unity array:
01/05/2017  10:59 AM        11,017,216 _default_20170105_150000.archive
01/05/2017  11:59 AM        11,017,216 _default_20170105_160000.archive
01/05/2017  01:00 PM        11,017,216 _default_20170105_170000.archive
01/05/2017  01:59 PM        10,983,424 _default_20170105_180000.archive
01/05/2017  02:45 PM         8,308,736 _default_20170105_190000-tmp.archive

 
 
There are two methods (CLI and Unisphere/GUI) that can be used to gather UPA files:
 
Via Unisphere/GUI
Option 1: Unisphere List


This option is only available for Unity OE 4.2 and higher. All available UPAs will not be listed here. There is a limitation to the GUI (Unisphere) and we can only list a certain number of available files. If you do not see the UPA required, you move to Option 2 or use SSH/CLI.


Unisphere List

Option 2: Unisphere Time Range
  • You can select a time range for the system to pull available UPAs.
    • Please confirm the issue time prior to gathering logs so only the necessary logs are downloaded.
  • Keep in mind, the Customer Time range will be in the local browser time and is often the same as the current workstation time.
  • This option is only available for Unity OE 4.2 and higherUnisphere Time Range

Via SSH/CLI

We can also use CLI to access more UPAs that are on the Unity array (but that are not listed in the GUI (Unisphere) per Option 1.

  • Login with the username "service" and the service password.

  • Copy the desired logs to /cores/service/user from /EMC/backend/metricsluna1/archives.

  • Download these logs via WinSCP or a similar tool.

    • If the array is on OE 4.5.X or newer, please change the file protocol to SCP

    • Login with the username 'service' and the service password.

 
ll /EMC/backend/metricsluna1/archives/
cp /EMC/backend/metricsluna1/archives/_default_20230221_180000.archive.gz /cores/service/user

Note: If you are only gathering current UPAs, you may want to |tail-10 as there are dozens of UPA files available on the backend; however, you can list them all out if you wish.

UPAs vis CLI


 
Host Data Collection
Host Grab (or EMC Report) output files may be needed, especially if the issue is unique to specific hosts.

  • Go to https://www.dell.com/support/home/ (Dell Support Site) and search for the Grab Utility for the correct host OS (Example: ESXi host grab)
  • Download the utility and the README file.
  • Follow the README file to run the utility for the host operating system chosen.

 
Switch Logs
If the performance issue only exists on certain paths, switch logs may be required. Please see the following LKB for instructions (based on the model being used):
 

Brocade/Connectrix

  • 000028508: Connectrix Switches: How to collect Switch Logs (Primary) https://www.dell.com/support/kbdoc/000028508
  • 000018926: Connectrix: How to collect a switch log file with a PuTTY session, step by step https://www.dell.com/support/kbdoc/000018926
Cisco
  • 000191330: How Do I collect Tech-Support Logs From A MDS Switch? https://www.dell.com/support/kbdoc/000191300
  • 000010120: Connectrix: How to collect logs from a Cisco MDS 9000 Family Switch using the CLI https://www.dell.com/support/kbdoc/00010120


Network Trace Captures (TCP Dump)
Some File Performances issues may be related to network issues. In order to fully determine the cause of a problem, support may request network traces. Please see the following LKB for instructions:

  • 000022522: Dell EMC Unity: How to gather the necessary information for a Unity performance analysis https://www.dell.com/support/kbdoc/000022522 

Additional Information

Please refer to this video:


Article Properties


Affected Product

Dell EMC Unity Family

Product
Dell Unity 300, Dell EMC Unity 300F, Dell EMC Unity 350F, Dell EMC Unity 400, Dell EMC Unity 400F, Dell EMC Unity 450F, Dell EMC Unity 500, Dell EMC Unity 500F, Dell EMC Unity 550F, Dell EMC Unity 600, Dell EMC Unity 600F, Dell EMC Unity 650F , Dell EMC Unity Family ...
Last Published Date

05 Apr 2023

Version

6

Article Type

How To