[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Testbed-admins] update need advice regarding "SNMP SET failed"



I have removed all the nodes using "withadminprivs /usr/testbed/sbin/deletenode pcx" and re-added them into the testbed. Because I want a clean startup for the experiment.

Now, all the nodes have matched control and experiment MACs with the ports on switches.
All the 'card2' are seted to '1', as below:
mysql> select * from wires;
+-------+-----+---------+----------+-------+-------+----------+-------+-------+
| cable | len | type    | node_id1 | card1 | port1 | node_id2 | card2 | port2 |
+-------+-----+---------+----------+-------+-------+----------+-------+-------+
|  NULL |   0 | Node    | pc1      |     1 |     1 | cisco2   |     1 |     4|
|  NULL |   0 | Node    | pc1      |     0 |     1 | cisco2   |     1 |     3|
|  NULL |   0 | Control | pc1      |     2 |     1 | cisco1   |     1 |     9|
|  NULL |   0 | Control | pc2      |     2 |     1 | cisco1   |     1 |    10|
|  NULL |   0 | Node    | pc2      |     1 |     1 | cisco2   |     1 |     6|
|  NULL |   0 | Node    | pc2      |     0 |     1 | cisco2   |     1 |     5|
|  NULL |   0 | Node    | pc3      |     2 |     1 | cisco2   |     1 |     8|
|  NULL |   0 | Node    | pc3      |     1 |     1 | cisco2   |     1 |     7|
|  NULL |   0 | Control | pc3      |     0 |     1 | cisco1   |     1 |     2|
+-------+-----+---------+----------+-------+-------+----------+-------+-------+

And as indicated as before, the snmp string are right and snmpset speed works fine.
> %snmpset -m all cisco2 vlan2 portAdminSpeed.1.1 = 100000000
> enterprises.cisco.workgroup.ciscoStackMIB.portGrp.portTable.portEntry.portAd
> minSpeed.1.1 = s100000000(100000000)

However, I still got the error message (the full error message is in the attachment):
Port not found, skipping
Port not found, skipping
*** ERROR: snmpit:
*** Can't "goto" into the middle of a foreach loop at
*** /usr/testbed/lib/snmpit_stack.pm line 465.
Does anyone have an idea about how to resolve this problem? Thanks

On Fri, Apr 17, 2009 at 3:14 PM, Cheng Cui <ccui1@tigers.lsu.edu> wrote:
Thanks Robert,

Quote from Robert:
>For now, let's just get the 'Node' wires right.

>All should have 'card2' set to '1', as you do have. Set 'port2' to
>whatever number the port is marked as having on the switch itself (eg.
>the number that's printed on the faceplate of the switch).

After I have done all these 'card2' setting to '1' and 'port2' matching with
the exp switch itself, including MAC verification, I still got this error
message:
*********************************
Port not found, skipping
Port not found, skipping
*** ERROR: snmpit:
***   Can't "goto" into the middle of a foreach loop at
***   /usr/testbed/lib/snmpit_stack.pm line 465.
reboot (pc): Attempting to reboot ...
reboot (pc3): Attempting to reboot ...
reboot (pc): Successful!
reboot (pc3): Successful!
reboot: Done. There were 0 failures.
*** ERROR: tbswap: Failed to set up VLANs.
*********************************

So is there any other possibility for the "port not found" error? Can I
delete the nodes using "withadminprivs /usr/testbed/sbin/deletenode pc1" and
re-add them in this situation?

Best Regards,

Cheng Cui
578-5445 . 231 Johnston Hall . Baton Rouge, LA 70803


-----Original Message-----
From: Robert P Ricci [mailto:ricci@cs.utah.edu]
Sent: Thursday, April 16, 2009 5:23 PM
To: Cheng Cui
Cc: 'Keith Sklower'; azad@cct.lsu.edu; testbed-admins@flux.utah.edu
Subject: Re: [Testbed-admins] update need advice regarding "SNMP SET failed"

Thus spake Cheng Cui on Wed, Apr 15, 2009 at 05:08:51PM -0500:
> What I get is:
> %snmpset -m all cisco2 vlan2 portAdminSpeed.1.1 = 100000000
>
enterprises.cisco.workgroup.ciscoStackMIB.portGrp.portTable.portEntry.portAd
> minSpeed.1.1 = s100000000(100000000)

Good, this indicates that the speed got changed, which means we can
concentrate on making sure the sires table is set up right.

> We got the data through web interface. I have to admit that boss could
> previously reach experiment switch before we altered the switch's ip
address
> by mistaken. At the very first, we see "snmp set" error; then, we
mistakenly
> altered the experiment switch's ip and get the "snmp get" error. After
Leigh
> and others point out our mistaken, I changed the experiment switch's ip in
> all the same subnet "192.168.0.x" for simplicity and get this "snmp set"
> error again. Honestly speaking, it was our mistaken to assume the snmp mib
> problem in a loop situation. I apologize for that. Now, it looks the wires
> table is the source of those problems because I assume there is a port
> mis-match in "switchmac", isn't it? : )

And let's go ahead and just get the wires table set up without worrying
about switchmac, since there are so few wires...

> mysql> select * from wires;
>
+-------+-----+---------+----------+-------+-------+----------+-------+-----
> --+
> | cable | len | type    | node_id1 | card1 | port1 | node_id2 | card2 |
> port2 |
>
+-------+-----+---------+----------+-------+-------+----------+-------+-----
> --+
> |  NULL |   0 | Node    | pc       |     1 |     1 | cisco2   |     1 |
> 9 |
> |  NULL |   0 | Node    | pc       |     0 |     1 | cisco2   |     1 |
> 1 |
> |  NULL |   0 | Control | pc       |     2 |     1 | cisco1   |     1 |
> 9 |
> |  NULL |   0 | Node    | pc2      |     1 |     1 | cisco2   |     1 |
> 5 |
> |  NULL |   0 | Node    | pc2      |     0 |     1 | cisco2   |     1 |
> 7 |
> |  NULL |   0 | Control | pc2      |     2 |     1 | cisco1   |     1 |
> 10 |
> |  NULL |   0 | Control | pc3      |     0 |     1 | cisco1   |     1 |
> 2 |
> |  NULL |   0 | Node    | pc3      |     2 |     1 | cisco2   |     1 |
> 19 |
> |  NULL |   0 | Node    | pc3      |     1 |     1 | cisco2   |     1 |
> 20 |
>
+-------+-----+---------+----------+-------+-------+----------+-------+-----
> --+

For now, let's just get the 'Node' wires right.

All should have 'card2' set to '1', as you do have. Set 'port2' to
whatever number the port is marked as having on the switch itself (eg.
the number that's printed on the faceplate of the switch).

--
/-----------------------------------------------------------
| Robert P Ricci <ricci@cs.utah.edu> | <ricci@flux.utah.edu>
| Research Associate, University of Utah Flux Group
| www.flux.utah.edu | www.emulab.net
\-----------------------------------------------------------




--
Best Regards,

Cheng Cui

578-5445 • 231 Johnston Hall • Baton Rouge, LA 70803
Running 'tbprerun  -e 43 test30.ns'
Beginning pre run for testbed/test30. 18:35:27:790584
Running parser ... 18:35:29:322035
Parser done! 18:35:33:939219
Precomputing visualization ...
Image rendering proceeding in background mode ...
Setting up static routes (if requested) ... 
Generating topomap ...
Verifying parse ...
Doing a pre-assign ...
assign_wrapper improved started
TIMESTAMP: 18:35:40:521142 assign_wrapper started
TIMESTAMP: 18:35:40:527102 TOP started
opened topfile
Minimum nodes   = 2
Maximum nodes   = 2
TIMESTAMP: 18:35:40:575096 TOP finished
Stopping after creating the TOP file, as directed.
Writing environment strings ...
Setting up additional program agent support ...
Writing program agent info ...
Pre run finished. 18:35:41:7779
Running 'tbswap in  testbed test30'
Beginning swap-in for testbed/test30 (43). 04/19/2009 18:35:41
TIMESTAMP: 18:35:41:992903 tbswap in started
Checking with Admission Control ...
Mapping to physical reality ...
TIMESTAMP: 18:35:42:11774 assign_wrapper started
assign_wrapper improved started
TIMESTAMP: 18:35:42:506623 assign_wrapper started
TIMESTAMP: 18:35:42:512548 TOP started
Resetting DB before updating.
opened topfile
Minimum nodes   = 2
Maximum nodes   = 2
TIMESTAMP: 18:35:42:576616 TOP finished
TIMESTAMP: 18:35:42:578048 assign_loop started
Assign Run 1
TIMESTAMP: 18:35:42:581398 ptopgen started
ptopargs -p testbed -e test30 
TIMESTAMP: 18:35:43:526303 ptopgen finished
TIMESTAMP: 18:35:43:528673 assign started
assign -P testbed-test30-5124.ptop testbed-test30-5124.top
   BEST SCORE:  2.34 in 17000 iters and 0.10344 seconds
TIMESTAMP: 18:35:45:542284 assign finished
TIMESTAMP: 18:35:45:544025 reserving started
TIMESTAMP: 18:35:46:387876 reserving finished
Successfully reserved all physical nodes we needed.
TIMESTAMP: 18:35:46:392693 assign_loop finished
TIMESTAMP: 18:35:46:395017 LoadPhysResources started
TIMESTAMP: 18:35:46:401208 LoadPhysResources finished
TIMESTAMP: 18:35:46:405899 interpreting started
TIMESTAMP: 18:35:46:408691 interpreting finished
TIMESTAMP: 18:35:46:410179 uploading started
TIMESTAMP: 18:35:48:156723 uploading finished
TIMESTAMP: 18:35:48:158582 assign_wrapper finished
TIMESTAMP: 18:35:48:573724 assign_wrapper finished
Mapped to physical reality!
Fetching tarballs and RPMs (if any) ...
TIMESTAMP: 18:35:48:580689 tarfiles_setup started
TIMESTAMP: 18:35:49:542620 tarfiles_setup finished
Setting up mountpoints.
TIMESTAMP: 18:35:49:547730 mountpoints started
TIMESTAMP: 18:35:54:100909 mountpoints finished
TIMESTAMP: 18:35:54:102921 named started
Setting up named maps.
TIMESTAMP: 18:35:55:30538 named finished
TIMESTAMP: 18:35:55:33387 gentopofile started
Generating ltmap (again) ...
TIMESTAMP: 18:35:55:949262 gentopofile finished
Resetting OS and rebooting.
TIMESTAMP: 18:35:55:952602 launching os_setup
Setting up VLANs.
TIMESTAMP: 18:35:55:960532 snmpit started
TIMESTAMP: 18:35:57:158153 os_setup started
TIMESTAMP: 18:35:57:174055 rebooting/reloading nodes started
Creating VLAN 60 as VLAN #3 on stack Experiment ... 
  Creating VLAN 60 as VLAN #3 on cisco2 ... Succeeded.
    Applying VLAN changes on cisco2 ... Succeeded
Port not found, skipping
Port not found, skipping
*** ERROR: snmpit:
***   Can't "goto" into the middle of a foreach loop at
***   /usr/testbed/lib/snmpit_stack.pm line 465. 
reboot (pc1): Attempting to reboot ...
reboot (pc3): Attempting to reboot ...
reboot (pc1): Successful!
reboot (pc3): Successful!
reboot: Done. There were 0 failures.
*** ERROR: tbswap: Failed to set up VLANs.
Cleaning up after errors.
Waiting for os_setup to finish
reboot (pc3): child returned 0 status.
reboot (pc1): child returned 0 status.
TIMESTAMP: 18:35:59:187774 rebooting/reloading finished
Waiting for local testbed nodes to finish rebooting ...
TIMESTAMP: 18:35:59:191439 Local node waiting started
Still waiting for pc3 - it's been 1 minute(s).
pc3 is alive and well
Still waiting for pc1 - it's been 1 minute(s).
pc1 is alive and well
TIMESTAMP: 18:37:28:291610 Local node waiting finished
OS Setup Done. 
TIMESTAMP: 18:37:28:309619 os_setup finished
Stopping the event system
TIMESTAMP: 18:37:30:660384 snmpit started
Removing VLANs.
  Removing VLAN #3 on cisco2 ... Succeeded.
    Applying VLAN changes on cisco2 ... Succeeded
TIMESTAMP: 18:37:31:916047 snmpit finished
Tearing down virtual nodes.
TIMESTAMP: 18:37:31:919432 vnode_setup -k started
vnode_setup running at parallelization: 10 wait_time: 120
Vnode teardown finished.
TIMESTAMP: 18:37:33:754742 vnode_setup finished
Freeing nodes.
TIMESTAMP: 18:37:33:758082 nfree started
Releasing all nodes from experiment [Experiment: testbed/test30].
Moving [Node: pc1] to [Experiment: emulab-ops/reloadpending]
Moving [Node: pc3] to [Experiment: emulab-ops/reloadpending]
TIMESTAMP: 18:37:35:431821 nfree finished
Resetting named maps.
TIMESTAMP: 18:37:35:436685 named started
TIMESTAMP: 18:37:36:328476 named finished
Resetting email lists.
TIMESTAMP: 18:37:36:332196 genelists started
TIMESTAMP: 18:37:37:206147 genelists finished
Resetting DB.
Failingly finished swap-in for testbed/test30. 18:37:37:224945
TIMESTAMP: 18:37:37:226401 tbswap in finished (failed)
*** ERROR: batchexp: tbswap in failed!
Cleaning up and exiting with status 1 ... 
Running 'tbend -f -e 43'
Beginning cleanup for testbed/test30. 18:37:38:73195
Clearing out virtual state.
Removing visualization data...
Cleanup finished! 18:37:39:734021
**** Experimental information, please ignore ****
Session ID = 11717
Likely Cause of the Problem:
  Failed to set up VLANs.
Cause: unknown
Confidence: 0.7
Script: tbswap
**** End experimental information ****
Removing experiment directories ...