Cloud News Without the Noise

Entries filed under “I/O”

RDMA and Storage at a Distance

Over at Forbes, Tom Coughlin writes that RDMA extends the capability of fast direct access to memory between computers in a cluster to greater distances, within a Metropolitan Area Network (MAN ) or even in a Wide Area Network (WAN) that can span continents.

RDMA over a WAN allows some very useful capabilities that can increase the overall power of a clustered computer system. It can provide remote collaboration with a remote file system allowing access as though it were local, enabling apparent real-time collaboration. RDMA also allows very efficient file transfer over a WAN. This direct data placement is accomplished with little impact on the processors on either end of the file transport. These features are very useful for working with large data files such as those common in many HPC applications. Storage at a Distance will not directly impact conventional client computing since these devices typically don’t have access to dedicated high-speed Internet connections. However with the growth of on-line (cloud) services the use of RDMA could accelerate many background processes within a given data center and between data centers. This could improve overall cloud performance and provide services such as fast backups and replications of data to provide data recovery. Thus Storage at a Distance could have a great impact on the overall performance and capabilities available over the Cloud.

Read the Full Story or see Coughlin’s recent Open Fabrics presentation over at inside-Cloud.

Also posted in Cloud, Storage | Leave a comment

Mellanox Reveals a Flexible Alternative to Closed-Code Ethernet Switches

Mellanox introduces an open Ethernet switch initiative designed to give users custom designs and superb return on investment.

The market’s move toward SDN and open source networking offers a variety of advantages that help drive data center productivity and currently is not available with traditional proprietary software,” said Gilad Shainer, vice president of marketing at Mellanox Technologies. “Our demonstration with Quagga highlights the power of Open Ethernet to provide the capability to fully customize open source software packages on top of Mellanox 40 and 56GbE switches, enabling our customers to add differentiation and competitive advantages in their networking infrastructure while reducing cost.”

Read the Full Story.

Also posted in Business of cloud, Cloud in Industry, Hardware, Software, Storage | Leave a comment

CloudSigma Consolidates on SSDs for HPC IaaS

Over at GigaOm, David Meyer writes that European IaaS provider CloudSigma has abandoned magnetic disks for solid-state storage. After a pilot test of SolidFire’s all-SSD storage system, CloudSigma now feels confident enough to offer a service-level agreement for performance, as well as uptime.

According to CloudSigma COO Bernino Lind, the shift to SSD is a major help when it comes to handling HPC workloads, such as those of Helix Nebula users CERN, the European Space Agency (ESA) and the European Molecular Biology Laboratory (EMBL):

They want to go to opex instead of capex, but the problem is there is no-one really who does public infrastructure-as-a-service which works well enough for HPC. There is contention — variable performance on compute power and, even worse, really variable performance on IOPS [Input/Output Operations Per Second]. When you have a lot of I/O operations, then you get all over the spectrum from having a couple of hundred to having 1,000 and it just goes up and down. It means that, once you run a large big data setup, you get iowaits and your entire stack normally just stops and waits.” Lind pointed out that, while aggregated spinning-disk setups will only allow up to 10,000 IOPS, one SSD will allow 100,000-1.5 million IOPS. That mitigates that particular contention problem. “There should be a law that public IaaS shouldn’t run on magnetic disks,” he said. “The customer buys something that works sometimes and doesn’t work other times – it shouldn’t be possible to sell something that has that as a quality.”

Read the Full Story.

Also posted in Business of cloud, Cloud, HPC | Leave a comment

New MIT Software Targets Data-Intensive Cloud Computing

When data-intensive applications meet the cloud, there may be stormy weather ahead.

Cloud computing services undeniably generate a long list of benefits: for example, economies of scale, responsiveness to fluctuating job requirements, in-depth technical support, and the pay-as-you-go scenario come to mind. But researchers at MIT are also aware that applications built around large-scale database queries can cause havoc in the cloud.

Cloud services often partition their servers into virtual machines. Each of these machines is constrained in a number of ways: for example, they may be assigned a finite number of operations per second on the server’s CPU, or allocated a limited amount of space in memory. According to MIT, that makes for easier management of the cloud servers, but it also can result in an allocation of about 20 times more hardware than is necessary to do the job. Naturally the cost of this overprovisioning gets passed on to the customer.

This has prompted MIT researchers to begin work on a new system called DBSeer. According to a recent press release, the software uses machine-learning techniques to build accurate models of performance and resource demands of database-driven applications.

The new algorithm at the heart of DBSeer has been released under an open-source license. Teradata, one of the leaders in the Big Data revolution, is already in the process of importing the algorithm into its solutions.

“With virtual machines, server resources must be allocated according to an application’s peak demand,” explains Barzan Mozafari, one of the MIT researchers. “You’re not going to hit your peak load all the time. So that means that these resources are going to be underutilized most of the time. Provisioning for peak demand is largely guesswork. It’s very counterintuitive, but you might take on certain types of extra load that might help your overall performance. Increased demand means that a database server will store more of its frequently used data in its high-speed memory, which can help it process requests more quickly.

However, a slight increase in demand could cause the system to slow down precipitously – if, for instance, too many requests require modification of the same pieces of data, which need to be updated on multiple servers. “It’s extremely nonlinear,” Mozafari says.

The MIT team has built a DBSeer model of MySQL and they are currently working on a new model for PostgreSQL – both widely used database systems.

Read the Full Story.

Also posted in Cloud | Leave a comment

Slidecast: ScaleOut Software’s In-Memory Data Grids Enable Real-Time Analysis

In this slidecast, CEO Bill Bain from ScaleOut Software presents: In-Memory Data Grids Enable Real-Time Analysis.

ScaleOut Software is a pioneer and leader in data grid software. Since our first products shipped in January 2005, we have consistently developed leading-edge technologies that help our customers solve scalability and performance challenges and gain competitive advantages for their businesses.”

Download the MP3 * Download the Slides (PDF)Subscribe on iTunes * If Dropbox is blocked, download audio from Google Drive.

Also posted in Cloud, Podcasts, Software, Video | Leave a comment

Why pNFS Matters

Ben Woo from analyst firm Neuralytix writes that all NFS v3 users should consider and plan a migration path towards pNFS.

While NFS and other network based file systems have been around for several decades, most of them suffer from being a funnel to all the hosts and clients to which it serves. As capacity and performance needs grow, NFS clients need to be remapped to multiple NFS servers in order to take advantage of additional storage and/or throughput requirements. This becomes extremely challenging in larger environments as NFS clients need to be mapped and remapped across multiple NFS servers in order achieve the desired results. pNFS solves this problem by aggregating multiple NFS servers and presenting this as a singular mountpoint. This solves not only the capacity scaling issue, but also the performance scaling needs of an enterprise. NFS clients no longer need to be mapped or remapped across multiple NFS servers. Instead, they can all mount a singular NFS namespace, and the underlying pNFS storage system will intelligently spread the capacity and load across the available pNFS nodes.

Read the Full Story.

Also posted in Business of cloud, Cloud, Software, Storage | Leave a comment

AWS Adds New EC2 High Storage Instance Family For Data-Intensive Applications

Over at TechCrunch, Alex Williams writes that Amazon Web Services has added a new storage instance for data intensive applications. Designed for applications that require high storage depth and I/O performance, the High Storage Eight Extra Large (hs1.8xlarge) instances includes 120 GiB of RAM, 16 virtual cores (providing 35 ECU of compute performance), and 48 TB of instance storage across 24 hard disk drives capable of delivering up to 2.4 GB per second of I/O performance.

The storage on this instance family is local, and has a lifetime equal to that of the instance. You should think of these instances as building blocks that you can use to build a complete storage system. You should build a degree of redundancy into your storage architecture (e.g. RAID 1, 5, or 6) and you should use a fault-tolerant file system like HDFS or Gluster. Of course, you should also back up your data to Amazon S3 for increased durability.

The new storage instances are applicable to data warehouse applications, log processing and specific applications for verticals such as retail and energy. In related news, the recently announced AWS Data Pipeline service is now available. Read the Full Story.

Also posted in Business of cloud, Cloud, Storage | Leave a comment

Nirvanix Using Violin Flash for “Infinite Elasticity”

Violin Memory has been selected by Nirvanix for its public Cloud Storage Network spanning 10 data centers around the world. Implemented across several multi-petabyte clouds, the including a leading $50 billion financial services company, the Nirvanix–Violin solution has reportedly increased performance for customers by up to 15 times and enabled Nirvanix to expand its business into new markets.

Integrating Violin Memory with our public, private and hybrid cloud solutions expands the opportunities for our customers to build their business on a cloud foundation and opens up business opportunities for us to better serve both new and existing customers,” said Dru Borden, CEO of Nirvanix. “The superior performance of Violin’s primary Flash Storage Arrays enhances the enterprise-class scale, security, data integrity and data accessibility that we provide.”

Nirvanix is essentially a URL with a byte stream payload that is immediately consistent and globally replicated based on policy with infinite elasticity. Violin Memory accelerates I/O performance within this framework. Read the Full Story.

Also posted in Business of cloud, Cloud, Hardware, Storage | Leave a comment

Interview: How the APAX Profiler Accelerates Memory Storage & Bandwidth

Samplify is relatively new company in the HPC market, but their APAX compression technology is already making waves in the market with both software and hardware approaches. To learn more, I caught up with Samplify’s CTO, Al Wegener, author of a new white paper that details how users can apply the APAX Profiler to increase application performance.

insideHPC: What is the APAX white paper about?

Al Wegener: The APAX white paper describes the Memory Wall problem associated with high-performance computing (HPC)), where additional CPU and GPU cores don’t generate faster results because you can’t “feed the beasts” (HPC processors) with operands from memory quickly enough. The paper describes a novel solution (encoding of numerical operands) that results in a measured Memory Wall reduction between 3:1 and 8:1 on HPC application as diverse as multi-physics, climate modeling, and k-means clustering. The APAX encoder works with the APAX Profiler tool to give HPC users new insight into the uncertainty of their input datasets. By encoding operands in software (today) and in memory controller hardware (soon), APAX numerical encoding gives HPC users an adaptive, controllable, and flexible way to reduce DDR, PCIe, Ethernet, Infiniband, and SAS/SATA bottlenecks by 3x to 10x.

insideHPC: What is the APAX Profiler?

Al Wegener: HPC datasets contain both uncertainty and redundancy. While HPC scientists may think their sensor-derived 32-bit or 64-bit data is perfect, typical HPC datasets pick up a lot of noise between the analog sensor and the multi-core CPU or GPU. The APAX Profiler software tool (also available on the Samplify web site) allows HPC users to upload their datasets in order to quantify uncertainties, and to determine the Profiler-recommended APAX encoding operating point that results in “five nines” (0.99999) of correlation between the original dataset and the decoded dataset. For many HPC datasets, “five nines” of decoded quality comes with encoding rates above 3:1, thus reducing the HPC Memory Wall while delivering identical HPC simulation results.

insideHPC: What is the “overcasting” problem and how does APAX help?

Al Wegener: Many HPC simulations, including climate, multi-physics, earthquake, genetic sequencing, and finite element analysis, begin and/or end with real-world sensor measurements. HPC simulations use sensor input to make predictions about the future, but HPC predictions must be compared to the “real world” via subsequent sensor measurements. Sensors generate integer values, but HPC simulations usually use 32-bit and 64-bit floats for computation. “Overcasting” is the tendency in HPC to cast integer values (often with 12 integer bits or less of quality) into floating-point values, without recognizing that the resulting float has been “overcast,” i.e. contains uncertainty that is not reflected in the 32-bit float. The APAX Profiler quantifies the degree of overcasting in HPC datasets by using spectral techniques (FFTs). After recommending an appropriate level of accuracy (uncertainty) for each dataset, the Profiler allows APAX users to fine-tune the accuracy of each dataset while significantly reducing bandwidth and storage requirements.

insideHPC: How is APAX technology a potential enabler for Exascale computing?

Al Wegener: According to the US DARPA Exascale study (2008), Exascale has memory, network, and disk problems, not compute problems. According to DARPA, in order to deliver 1018 flops per second (Exascale), DDR3 memory would have to get 16x faster, while disk drives would have to get 100x faster. By encoding HPC operands (numbers) as they are transferred between multi-core CPU and GPU sockets and DDR, network, and disk drives, APAX reduces the DDR, network, and disk drive bottlenecks of Exascale by user-controllable factors between 3x and 8x.

insideHPC: How does APAX encoding save energy and reduce cloud computing costs?

Al Wegener: Cloud computing depends on cloud-based hardware, but cloud users have to send their data to the cloud and then they have to download the results. By reducing both upload and download costs for users of HPC-on-demand services like Amazon EC2 and Microsoft Azure, APAX saves cloud users both time and money. In addition, experienced cloud users know that CPUs only draw about 40% of server power, while the other 60% is dissipated by DDR memory and disk drives. When APAX reduces DDR and disk bottlenecks, HPC users get their result faster, which reduces cloud-based energy usage. In one memory-bound HPC application, APAX 4:1 encoding resulted in a 3.8x speed-up in “time to results,” and thus a 3.8x reduction in server power consumption.

insideHPC: How is APAX effectively lossless?

Al Wegener: Since sensor samples often comprise the source material for HPC simulations, it’s important to recognize that floating-point numbers are using more bits than required to represent the dynamic range of integer samples. The APAX profiler quantifies the degree to which HPC datasets were overcast and encodes those datasets into “simply the bits that matter.” As APAX beta-testers in HPC climate, multi-physics, and earthquake simulations have verified, their HPC simulation results are identical, but the results come out faster. That’s what Samplify calls “effectively lossless” encoding – the size of HPC input and intermediate datasets are reduced by 3x to 8x, but the results remain the same.

Samplify will demonstrate APAX next week at SC12 booth #4151.

Also posted in Cloud, Software, whitepapers | Leave a comment

Slidecast: APAX Compression Lowers Cost of Cloud

In this video, Samplify CEO Alan Evans presents: APAX: Lowering the Cost of Big Science, Big Data, and Cloud Computing.

Multi-core CPUs are hitting the memory wall,” said Al Wegener, CTO and founder of Samplify. “With each new process node, the number of processor cores on a die can double with Moore’s Law, but the throughput of memory, I/O, and storage fails to keep up with this growth. Hence, the performance of multi-core applications is increasingly memory, I/O, and storage bound. APAX is the only solution that accelerates the throughput DDRx, SAS/SATA, SSD, PCIe, Ethernet, and Infiniband, by up to six times.”

Samplify will demonstrate the APAX profiler and hardware IP at the SC12 conference in booth #4151.

Read the Full Story * Download the MP3 * Download the slides (PDF)Subscribe on iTunes * If Dropbox is blocked, download audio from Google Drive.

Also posted in Business of cloud, Cloud, Compute, Events, Podcasts, Software, Video | Leave a comment

View All Videos