<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Monitoring NetApp with Nagios and Nagiosgraph</title>
	<atom:link href="http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/feed/" rel="self" type="application/rss+xml" />
	<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/</link>
	<description>My digital diary with topics about family, work, computers, technology, books and whatever else comes to mind</description>
	<lastBuildDate>Sun, 12 Jul 2009 03:05:58 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: links for 2009-07-11 &#124; Amano Jyaku.Info</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-604</link>
		<dc:creator>links for 2009-07-11 &#124; Amano Jyaku.Info</dc:creator>
		<pubDate>Sun, 12 Jul 2009 03:05:58 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-604</guid>
		<description>[...] Monitoring NetApp with Nagios and Nagiosgraph « Nerhood Weblog (tags: nagios monitoring netapp) [...]</description>
		<content:encoded><![CDATA[<p>[...] Monitoring NetApp with Nagios and Nagiosgraph « Nerhood Weblog (tags: nagios monitoring netapp) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: morbid</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-601</link>
		<dc:creator>morbid</dc:creator>
		<pubDate>Thu, 28 May 2009 19:55:48 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-601</guid>
		<description>any idea how to fix latency?
I&#039;ve removed

# # Just in case of problems, let&#039;s not hang Nagios  
# $SIG{&#039;ALRM&#039;} = sub {  
#         print &quot;ERROR: No snmp response from $hostname (alarm timeout)\n&quot;;  
#         exit $ERRORS{&#039;UNKNOWN&#039;}; 

and it seems to work while doing the manual checks. Since it&#039;s a distributed system when DMS sends the info to CMS it still errors out</description>
		<content:encoded><![CDATA[<p>any idea how to fix latency?<br />
I&#8217;ve removed</p>
<p># # Just in case of problems, let&#8217;s not hang Nagios<br />
# $SIG{&#8217;ALRM&#8217;} = sub {<br />
#         print &#8220;ERROR: No snmp response from $hostname (alarm timeout)\n&#8221;;<br />
#         exit $ERRORS{&#8217;UNKNOWN&#8217;}; </p>
<p>and it seems to work while doing the manual checks. Since it&#8217;s a distributed system when DMS sends the info to CMS it still errors out</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Francis</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-600</link>
		<dc:creator>Steve Francis</dc:creator>
		<pubDate>Thu, 28 May 2009 19:40:08 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-600</guid>
		<description>Hm. My RPN got truncated:
it should be (all one line):
dfLowTotalKBytes,0,LT,4294967296,dfLowTotalKBytes,+,dfLowTotalKBytes,IF,
dfHighTotalKBytes,4294967296,*,+,1024,*</description>
		<content:encoded><![CDATA[<p>Hm. My RPN got truncated:<br />
it should be (all one line):<br />
dfLowTotalKBytes,0,LT,4294967296,dfLowTotalKBytes,+,dfLowTotalKBytes,IF,<br />
dfHighTotalKBytes,4294967296,*,+,1024,*</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Francis</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-599</link>
		<dc:creator>Steve Francis</dc:creator>
		<pubDate>Thu, 28 May 2009 19:39:09 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-599</guid>
		<description>To be accurate, given that NetApp returns signed integers with unsigned content, you have to be careful, otherwise you end up with adding the raw dfLowAvailKBytes, instead of the sign corrected version, or multiplying the negative dfHighAvailKBytes, and ending up with crazy stuff.
In RPN form, what you want is:
dfLowTotalKBytes,0,LT,4294967296,dfLowTotalKBytes,+,dfLowTotalKBytes,IF,dfHighTotalKBytes,4294967296,*,+,1024,*

Other NetApp gotchas are that they change the units they report metrics in between releases. e.g. latency for reads and write is reported in milliseconds before 7.3, microseconds after. And should not even be collected on early 7.3 code, due to bugs.
This is some of the stuff that LogicMonitor automates and saves lots of time on.</description>
		<content:encoded><![CDATA[<p>To be accurate, given that NetApp returns signed integers with unsigned content, you have to be careful, otherwise you end up with adding the raw dfLowAvailKBytes, instead of the sign corrected version, or multiplying the negative dfHighAvailKBytes, and ending up with crazy stuff.<br />
In RPN form, what you want is:<br />
dfLowTotalKBytes,0,LT,4294967296,dfLowTotalKBytes,+,dfLowTotalKBytes,IF,dfHighTotalKBytes,4294967296,*,+,1024,*</p>
<p>Other NetApp gotchas are that they change the units they report metrics in between releases. e.g. latency for reads and write is reported in milliseconds before 7.3, microseconds after. And should not even be collected on early 7.3 code, due to bugs.<br />
This is some of the stuff that LogicMonitor automates and saves lots of time on.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scott Murphy</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-598</link>
		<dc:creator>Scott Murphy</dc:creator>
		<pubDate>Thu, 28 May 2009 19:18:54 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-598</guid>
		<description>slight modification to that, the high order value is the number of times to multiply by 2^32 and add the low order value to it.</description>
		<content:encoded><![CDATA[<p>slight modification to that, the high order value is the number of times to multiply by 2^32 and add the low order value to it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scott Murphy</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-597</link>
		<dc:creator>Scott Murphy</dc:creator>
		<pubDate>Thu, 28 May 2009 16:08:17 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-597</guid>
		<description>The problem is the SNMP v1 and v2c MIBs have 32 bit integers and v3 has 64 bit integers. You need DOT 7.3 to get use the v3 MIB. 

To get valid data between 2 and 4TB, you need to use unsigned integers. Larger volume/aggregate sizes are also supported in the v1 and v2c MIB but they are split into a low and high value, so you need to combine them. If you look through the MIB, you will see entries like:

	dfHighAvailKBytes
	dfLowAvailKBytes

in the dfTable section so you need to shift the first value left 32 bits and add them together.

I was hoping someone had already put this into the plugin but I guess not yet. I only started looking at this a couple of days ago when I got an insane response for a 6TB volume.</description>
		<content:encoded><![CDATA[<p>The problem is the SNMP v1 and v2c MIBs have 32 bit integers and v3 has 64 bit integers. You need DOT 7.3 to get use the v3 MIB. </p>
<p>To get valid data between 2 and 4TB, you need to use unsigned integers. Larger volume/aggregate sizes are also supported in the v1 and v2c MIB but they are split into a low and high value, so you need to combine them. If you look through the MIB, you will see entries like:</p>
<p>	dfHighAvailKBytes<br />
	dfLowAvailKBytes</p>
<p>in the dfTable section so you need to shift the first value left 32 bits and add them together.</p>
<p>I was hoping someone had already put this into the plugin but I guess not yet. I only started looking at this a couple of days ago when I got an insane response for a 6TB volume.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: morbid</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-590</link>
		<dc:creator>morbid</dc:creator>
		<pubDate>Sun, 19 Apr 2009 14:24:24 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-590</guid>
		<description>Latency and time outs

The systems that are &lt;10ms away checks are coming back just fine
Anything further away I&#039;m doing -t 150.
During manual check checks come back fine, but automatic checks still come back timed out. 
Any ideas?

Also volumes over 10TB come back with funky numbers, free space is OK, but the totals are incorrect.</description>
		<content:encoded><![CDATA[<p>Latency and time outs</p>
<p>The systems that are &lt;10ms away checks are coming back just fine<br />
Anything further away I&#8217;m doing -t 150.<br />
During manual check checks come back fine, but automatic checks still come back timed out.<br />
Any ideas?</p>
<p>Also volumes over 10TB come back with funky numbers, free space is OK, but the totals are incorrect.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris Wicklein</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-587</link>
		<dc:creator>Chris Wicklein</dc:creator>
		<pubDate>Wed, 01 Apr 2009 12:33:53 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-587</guid>
		<description>I don&#039;t think the use of abs to correct negative numbers is correct.  It looks like the problem is with unsigned 32-bit ints being misinterpreted as signed 32-bit ints.  An easy way to fix this with with pack/unpack:

$used = unpack(&quot;I&quot;, pack(&quot;i&quot;, $used));</description>
		<content:encoded><![CDATA[<p>I don&#8217;t think the use of abs to correct negative numbers is correct.  It looks like the problem is with unsigned 32-bit ints being misinterpreted as signed 32-bit ints.  An easy way to fix this with with pack/unpack:</p>
<p>$used = unpack(&#8221;I&#8221;, pack(&#8221;i&#8221;, $used));</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Francis</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-585</link>
		<dc:creator>Steve Francis</dc:creator>
		<pubDate>Thu, 26 Feb 2009 19:28:04 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-585</guid>
		<description>A quick commercial plug, so take with a grain of salt, but if you wish to solve your NetApp monitoring issues with no configuration or coding, take a look at www.logicmonitor.com. It provides complete performance and fault monitoring (including per volume latency and IO operations), it requires no configuration, and deals with volume instance renumbering.
Not free like Nagios, but requires no investment of time.</description>
		<content:encoded><![CDATA[<p>A quick commercial plug, so take with a grain of salt, but if you wish to solve your NetApp monitoring issues with no configuration or coding, take a look at <a href="http://www.logicmonitor.com" rel="nofollow">http://www.logicmonitor.com</a>. It provides complete performance and fault monitoring (including per volume latency and IO operations), it requires no configuration, and deals with volume instance renumbering.<br />
Not free like Nagios, but requires no investment of time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: links for 2008-11-25 : JOSHMEANS.COM</title>
		<link>http://nerhood.wordpress.com/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-572</link>
		<dc:creator>links for 2008-11-25 : JOSHMEANS.COM</dc:creator>
		<pubDate>Tue, 25 Nov 2008 12:03:54 +0000</pubDate>
		<guid isPermaLink="false">http://nerhood.homeip.net/wordpress/archives/2006/06/19/monitoring-netapp-with-nagios-and-nagiosgraph/#comment-572</guid>
		<description>[...] Monitoring NetApp with Nagios and Nagiosgraph Potential hooks into GW? (tags: monitor netapp nagios rrd cacti) [...]</description>
		<content:encoded><![CDATA[<p>[...] Monitoring NetApp with Nagios and Nagiosgraph Potential hooks into GW? (tags: monitor netapp nagios rrd cacti) [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
