Emulex Blog: The Implementer's Blog

Installing or Updating Emulex Drivers on VMware ESXi 5.0

Posted November 22nd, 2011 by Alex Amaya

Most likely, you are not surprised to hear that VMware ESXi 5.0 users no longer have access to a Service Console. You may have also noticed that there are several new features and changes. One change is the install procedure to manually update or install Emulex drivers. Most of the Emulex drivers are inbox drivers and will need to be updated whenever a new version is released. I’d like to share the process for updating your Emulex drivers in this blog post. Other options you may wish to consider are auto deploy, or using the VSphere Management Assistant (vMA) appliance.
Here are the steps to updating your drivers:

  1. Login with your VMware vSphere Client to vCenter Server.
  2. Select the host you want to update or install new drivers.
  3. Go into Tech Support Mode to enable SSH. It is a simple task to perform: Highlight the host-> select Configuration Tab -> then select Security Profile from the software table of contents. Highlight TSM-SSH then Properties. Once you enable SSH, a warning symbol will appear to let you know your host is no longer secure. See VMware KB 1016205 & 2003637.
  4. From your Windows or Linux client, download the Emulex driver for the adapter and store it in a temporary location.
  5. From your Windows or Linux client, run a program such as WinSCP for Windows and move the driver you downloaded from VMware’s website to the ESXi host. I prefer to place the Emulex driver in the /var/log/vmware directory.
  6. Next, SSH into the ESXi 5 host by using a tool called putty.exe
  7. Once logged in, run the following commands to install the driver:
    1. # esxcli software vib install –no-sig-check –maintenance-mode -d
    2. Example: #esxcli software vib install –no-sig-check –maintenance-mode –d Emulex-FCoE-FC-lpfc829-8.2.3.108.36-offline-bundle.zip
  8. Reboot the host to activate the new or updated driver
  9. If for some reason you need to remove the driver, execute the following esxcli command: # esxcli software vib remove –n –f

We hope this helped you learn how to install or update Emulex drivers on VMware ESXi 5.0. If you have any feedback or questions, comment here or contact Emulex Technical Support.

How to optimize your SAN resource utilization and save $200K per year!

Posted November 1st, 2011 by Erick Crowell

Administrators are seeking creative ways to get more from their existing infrastructure. A recent survey of CIOs reveals what many already know; IT budgets are stagnant or shrinking. At a time of explosive growth and increased demand for performance, organizations are being pushed to innovate to survive. Given the limited ability to grow, administrators look to optimize existing resources, squeezing out performance, to help them to meet demand.

One strategy involves auditing to find unused or underutilized SAN-attached storage, which is something Emulex OneCommand Vision does (which we announced version 2.0 of today at SNW Europe, check out our announcement here). Inactivity on a LUN, for example, is an indication that an applications demand for storage may be changing. Each SAN-attached LUN represents a ‘chunk’ of infrastructure dedicated to a particular compute resource, such as a server. As the demand for SAN-attached resources rises, the opportunity cost of letting underutilized resources remain in place rises.

Auditing for underutilized resources at the current storage tier allows administrators to reprovision costly infrastructure, moving resources to alternative storage tiers or retiring them altogether. Repurposing allows organizations to avoid or defer the costs to acquire additional capacity. Reclaiming as little as two percent of the storage infrastructure can save nearly $200,000 per year for a mid-sized SAN deployment.

Let’s consider the numbers. What does it cost to provision SAN-attached storage to your application or database server? To arrive at an answer we chose ‘street prices’ for equipment typically found in a ‘mid-sized’ SAN deployment (500 servers).

The general costs are:

  • Plumbing the storage network between the server and storage array (network adapter, multi-tier network/fabric), about $8K for two redundant paths
  • Fault-tolerant SAN-attached 4 TB LUN, approx. $8K
  • Storage Management software, approx. $4K

That’s about $20K to connect that super fast, highly available storage to each of your application or database servers.

Multiply that across all 500 SAN-attached servers in this ‘mid-sized’ environment and the total SAN infrastructure cost totals about $10 Million. Organizations that can repurpose just 2% of the previously under-utilized infrastructure can save about $200K a year, in deferred infrastructure costs.

These innovative organizations look for this kind of ‘low-hanging fruit’ as they struggling to maximize their limit budgets. What strategies have your organization tried? Please take a moment and share your experience in the comments below!


¹New C4 Agenda: Perspectives and Trends from State Government IT Leaders, sponsored by TechAmerica, the National Association of State Chief Information Officers (NASCIO), and Grant Thornton LLP.

16GFC: Much more than a speed bump

Posted October 10th, 2011 by Mark Jones

In preparation of the release of our latest Host Bus Adapter (HBA) – the LightPulse® 16Gb Fibre Channel (16GFC) – we took a look at its performance and were pretty amazed by the results compared to our previous gen product. The LPe16002 is of course capable of running 16GFC, so you would expect performance to be about double that of 8GFC, but then again, when you dig into the details of the 16GFC spec, you may be disappointed to find out that your data bits aren’t actually flying over the wire at double the speed compared to 8GFC. 16GFC actually runs at 14.025Gbp/s baud rate where 8GFC runs at 8.5 so it’s slower right? Wrong! The designers of the specification did a clever thing when they came up with 16GFC. All previous speeds used a 8b/10b encoding scheme, meaning that for every 10 bits flying over the wire, 8 of them are data and 2 are used to make sure the data is correct, so only 80% of the bits are your data. For 16GFC, they changed the encoding to a much more efficient 64b/66b scheme, so much less of the bits are wasted for coherency, and a bigger chunk of it is your data. So the bottom line is that 16GFC link rate delivers twice the data deliver over 8GFC.

But saying the new LPe16002 HBA can deliver twice the performance over 8GFC HBAs is the expectation, but there is much more to the story. So sure, as you can see in figure 1, the 16GFC LPe16002 HBA is capable of 1576MB/s compared to 789MB/s for our 8GFC LPe12002 HBA, almost exactly double. The LPe16002 is the first HBA with an 8-core processor and can deliver performance that is actually 5x that of previous adapters.

Figure 1.

Max I/O per second (IOPS) for a single port of the LPe16002 is just over 1 million! This is one area where the 8-core processor shines, it has the ability to crank out millions of I/O processing instructions when the data size isn’t too burdensome. At this small data block size that we measure, this kind of performance is all about the HBA processor performance and not the link rate, this HBA would be capable of 1 million IOPS even at 8GFC…..that’s right, you can get this kind of IOPS without having to buy a new 16GFC switch.

Figure 2.

Ok, I know what you’re thinking….”My application doesn’t use 512 byte data blocks”. Good point, the fact is that the highest transaction applications actually use mostly 4k or 8k data blocks and that area is another measure of the muscle that the LPe16002 has. Most HBAs cannot reach the link rate maximum until the data blocks get very large, usually around 16k data block sizes, so I/O transactions for database applications can be limited by the HBA processor and can’t get close to the Fibre Channel link rate ceiling. This is not the case with the LPe16002, because its massive power allows for near link rate performance at 4k and 8k data blocks.

Like previous generations of Emulex HBAs, they are backwards compatible and leverage a common driver stack. This means that you can buy a LPe16000 HBA today and plug it into your current server alongside our previous products and connect the card into your current 4 or 8GFC switch. You will not be able to utilize the full size of a 16GFC link until you upgrade your switch, but that doesn’t mean you won’t see other massive performance benefits like lower I/O response times and higher transaction rates. To see proof of this and a full performance demo, see the LPe16000 performance demonstration video.

Figure 3.

Sometimes you need to see the protocol conversation to understand and solve SAN performance issues

Posted October 5th, 2011 by Erick Crowell

This is the second installment in a series of blogs that will discuss SAN performance monitoring and troubleshooting.

Consider this situation faced by one of our ‘large financial’ customers with a complex SAN environment running critical trading applications. This customer had been experiencing periodic performance problems with a Windows cluster running a critical business application. In a nutshell, application I/O was taking too long, internal timers would expire and the application would shut down.

After each occurrence, administrators would quickly stabilize the application, balancing the need to collect information about the issue with the requirement to minimize the amount of money the company was losing. Many trouble tickets where opened and symptoms were well-understood, but a root cause could not be found using the tools available to the team.

Eventually, vendors were asked to help prove that their equipment was not at fault. One by one, each vendor used their own management applications to demonstrate that no faults existed in their equipment. Finally, under the guidance of Emulex Technical Support, the customer enabled extended logging on the servers experiencing the slowdown. This extended logging allowed for the typically hidden Extended Link Service (ELS) and SCSI protocol events captured by the Emulex adapters to be collected in a LOG file. These low-level events are processed by the adapters and driver software but are not reported and are certainly not available through any native OS API or event interface.

After a brief review of the protocol events collected, the teams identified an unexpected pattern: the storage target was repeatedly sending Port Logout (LOGO) commands to each server in the Windows cluster. This LOGO command would cause all outstanding I/O operations to be cancelled, which would require each server to login (again) and re-send outstanding I/O. Although some number of I/O would complete, the net effect was to dramatically slow the overall performance, pushing the application past an internal limit.

After some searching using similar methods, the customer was able to identify a separate Linux machine, connected to the same target array, that was periodically sending ‘Target Reset’ commands to the storage array. Each time the storage array received one of these ‘reset’ commands it would, in-turn force the ‘Logout’ of all attached initiators. In the end, the customer was able to isolate the Linux server, thereby removing the source of the problem and resolving the application performance issue.

So why couldn’t other tools already in the environment detect this problem? To put it simply, from the perspective of device and SAN management tools, there was no problem. Storage links remained active, devices along the storage path were on-line with plenty of capacity, and I/O that made it to back-end storage devices was returned in a timely fashion. None of these tools monitor the end-to-end protocol conversation, which explains why these applications missed the problem in the first place.

Tools designed to collect, consolidate and analyze the information from each of the initiators in your environment help reveal hidden performance and health problems that other applications miss. Emulex offers this power in a tool we call OneCommand Vision. I invite you to cruise over to our landing page to learn more.

Dude, do you know how to install Emulex ESXi 4.1 drivers?

Posted October 4th, 2011 by Alex Amaya

Have you seen the movie called Dude, Where’s My Car? I laugh every time I see the scene where they go back and forth on their tattoos:

“We got tattoos!”
“What does mine say, Sweet!” … “What does mine say, “Dude!”
“What does mine say, Sweet!” … “What does mine say, Dude!”

…And it goes on and on until they get tired of each other and fight about it. Well, it has nothing to do with this blog, but Dude! You do need to know where and how to install Emulex ESXi 4.1 drivers.

From time to time, I hear from Emulex customers and partners who need to know how to either install or update Emulex drivers for the LightPulse Fibre Channel Host Bus Adapters (HBAs) or OneConnect 10Gb Ethernet (10GbE) Universal Converged Network Adapters (UCNAs). In response to these questions, I’ve created a new Application Note that takes you through the process of installing the Emulex Fibre Channel ESXi drivers using the vSphere Command-line interface (vCLI).

After reading this app note, if you still have questions on installing or updating our drivers with VMware ESX, please post a comment to this blog, or contact Emulex Technical Support, or send an email to me here at implementerslab@emulex.com.

VMworld 2011 – High Performance Demonstrations at the Emulex Booth

Posted August 29th, 2011 by Mark Jones

Red Ducati Giveaway VMworld 2011If you’re going to be at VMworld, be sure to stop by the Emulex booth at space #326. We don’t have the biggest booth, but we will be easy to find. Just look for the huge crowd gathered around trying for a chance to win a new Ducati 696 Monster motorcycle. As you can see from the picture we had a good time with the bike before the show…just kidding, geeks like us can’t ride a bike like that! But we are pretty serious about performance of networks and storage and we will have a number of product demonstrations that will be almost as exciting as the Ducati.

  • 16Gb/s Fibre Channel (FC): 16Gb FC makes VMware ESX 5 run so fast that it makes a Ducati look like a scooter. We have a technology demonstration comparing the performance of 16Gb FC, running on our new XE201 I/O Controller, to 8Gb FC. While it’s not a released or supported configuration, yet, you will be surprised at just how easy it is for the Emulex Lightpulse® architecture to make this work.
  • iSCSI over Data Center Bridging (DCB): This new feature allows Emulex OneConnect™ iSCSI adapters to work over a 10GbE converged network to reduce latency and optimize bandwidth allocation and performance. We will demonstrate how to configure an end-to-end lossless iSCSI storage connection between the Emulex OCe11102-I adapter and a Dell EqualLogic PS6010 iSCSI storage array, using the new v5.1 firmware, which supports lossless iSCSI traffic.
  • Universal Multi-Channel (UMC): This feature, is supported with Emulex OneConnect OCe11102 10Gb Ethernet (10GbE) Universal Converged Network Adapters (UCNAs). With UMC, OCe11102 dual-port UNCAs present eight unique PCI functions to the VMware ESXi™ hypervisor that can be assigned to virtual machine (VM) applications, VM migration and VM management. The demonstration will show how to set up UMC, manage bandwidth allocation and configure functions in a VMware environment.
  • OneCommandTM Manager for VMware vCenter™ Server v1.1 This latest version of OneCommand Manager plug-in for VMware vCenter Server will be used to manage all the HBA and UCNAs installed in the demos. This is a must see demonstration, you will be amazed how flexible and comprehensive this tool is to use. And the best part is that it is free!
  • OneCommand Vision: Emulex OneCommand Vision is an I/O management tool that proactively monitors I/O performance and availability within physical, virtual and cloud data center environments. Emulex OneCommand Vision is the I/O management solution for VMware environments that monitors end-to-end performance, including the ability to proactively detect hidden storage-side contentions that are adversely impacting application performance. This demonstration will showcase this and how OneCommand Vision enables organizations to evaluate, monitor, diagnose and improve application I/O performance.

Ask to see our demos at booth #326. You can’t miss us, look for the red Ducati doing a wheelie!

Having trouble diagnosing those hard to solve I/O performance issues? Let your initiators do the heavy lifting.

Posted August 24th, 2011 by Erick Crowell


This is the first installment in a series of blogs that will discuss SAN performance monitoring and troubleshooting.

It was a typical crisis in the data center. Another application slowdown has the team working late nights, and working with vendors to determine whose equipment is causing problems. Application performance monitoring says the servers have CPU and memory to spare. Storage Resource Monitoring (SRM) tools are telling you there are loads of capacity and bandwidth, but the application is still unresponsive and the trouble tickets are pouring in.

Sound familiar? In spite of the numerous tools available to administrators today, often understanding and overcoming application I/O performance problems requires a deeper understanding of the protocol conversations occurring between devices in the storage network.

Traditional tools leave administrators with I/O ‘blind spots’. Diagnosing problems in these blind spots forces administrators to start searching for clues in hard to reach places. This search often involves adjusting driver settings, attaching ‘taps’ to capture traffic, contacting equipment vendors, and re-educating your team on the finer details of Storage protocols.

There are the obvious ‘capacity’ related slow-downs which occur when interconnect or storage equipment is overloaded. In these cases, the adapters, switches, or storage arrays are not able to handle the load. These types of ‘physical’ limits can significantly affect I/O performance but are often easily resolved by procuring more capacity or redistributing the I/O load. To avoid them, most organizations deploy tools that monitor and report when certain physical thresholds have been exceeded before they become problems.

Lesser known are the many ‘soft’ performance issues caused by misbehaving or misconfigured infrastructure attached to the SAN. These ‘harder to detect’ problems can silently impact the performance of other devices (servers) sharing the same infrastructure. Too often, the first signs of trouble are alerts sent from application monitoring tools and users. Diagnosis of these issues often occurs during ‘downtime’ events and little to no proactive detection techniques are available.

But there is good news here, these types of soft problems can be understood using the intelligence built-in to every one of your SAN-attached server. This intelligence is often hidden from users but when exposed and understood, it can be extremely helpful in diagnosing tricky interaction problems.

It’s a fundamental part of the architecture of storage networking protocols (like SCSI, Fibre Channel and Fibre Channel over Ethernet). As the initiator of an I/O transaction, the server (or adapter within it) can expect that all other devices must make every effort to push warnings and failures back to it. Generally speaking, these events can be classified into four categories; Extended Link Service (ELS), Fabric, SCSI Protocol, and physical hardware (on the initiator). Each deal with a separate layer of the conversation and all are important to understanding the ‘end-to-end’ conversation.

In a sense, every initiator, when properly configured, can serve as a probe for its own I/O traffic. By looking down the path to the storage it’s interacting with, each initiator can be used to construct a detailed picture of the health of the devices involved in the conversation. Taken together, this initiator view combined from multiple servers can be used to form a ‘whole SAN’ view of I/O performance and health.

In the next installment, we will talk about interpreting the SCSI conversation taking place in your SAN and using that to give you clues to potential problems brewing. So watch this space for more. And, as always, let us know what you think.