ewgates
November 6th, 2009, 05:51 PM
I am currently trying to run an instance on an Ubuntu 9.10 Server with Eucalyptus cloud. Before that, I was trying to run an instance on Ubuntu 9.04 with Eucalyptus cloud installed by hand with the same results. The instances show pending, sometimes for hours, before terminating. I have never seen a running state. I would appreciate any feedback at all.
The Ubuntu install disc was created on Monday morning 2Nov09.
During the installation, I followed the UEC/CDInstall document found at https://help.ubuntu.com/community/UEC/CDInstall (https://help.ubuntu.com/community/UEC/CDInstall) .
The setup is two Dell E6400 laptops, one as the Cloud and CC (hostname cloud1, IP=192.168.2.226), the other as the Node (hostname cloud3, IP=192.168.2.212).
VT selections in the BIOS of the laptops are set.
Eucalyptus on the CC can SSH into eucalylptus on the NC without passwords.
The image is the “Ubuntu 9.10 RC – Karmic Koala (amd64) Image Version 20091022” installed from the eucalyptus login, “store” menu. The only user is admin.
On my latest installation attempt, I ran “euca-run-instances emi-DEE31066 –k mykey –t m1.xlarge –n 1-4” , it remained pending then terminated without going into the running state. In the logs it is run on 05Nov09 at 3:47pm.
I also ran “euca-run-instances emi-DEE31066 –k mykey –t c1.medium –n 1-4” on this and previous installations this week, with the same results. The c1.medium run looked like it was creating two instances at 192.168.2.100 and 192.168.2.101.
During previous installs this week, using Ubuntu 9.10 Server with Eucalyptus cloud, I saw the same issues, but tried various solutions found online.
In this last install, I setup the partitions to include entire disk with LVM, but previous installs this week simply specified entire disk without LVM because I don’t think I need LVM.
During installation specified no email, because I plan to run all instances as admin.
One observation, when I run “euca-describe-availability-zones verbose”, I do see the cluster name with IP, and the instance sizes, but no nodes are listed. There are no nodes listed in the UEC/CDInstall example either but other documents state that the nodes should be listed.
From the logs, it does look like the CC and NC are communicating, and the NC is communicating with Walrus.
I am pretty confident that the keys and certifications match properly between the front end and node. One observation is that I see, in the cc.log file, the following ssh key:
(http://192.168.2.212:8775/axis2/services/EucalyptusNC) running instance: i-4A950821 emi-DEE31066 d0:0d:4A:95:08:21 d0:0d:4A:95:08:21 10 ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCA+qIIDTjfIWfOXe7aNN KdTGZKXisgRXZU2fFEyA3FIJqBbRNlrz2tF8mnBWvK2k4KgnQD Lrod989K+3CHHxwZRIJ8Zt3b8rg4z5MVMTdknSN3I7754BTqaQ ACWIq6ymADYEnFKBCi0QueAEdLELfMyMOLnvyyVD/hKVL8SEAAkrGBqNnmR3yfs8JutjsgjBzupanXwkEKm2SdHyZY6 6rh3kxBCwzJ/JGwoonpVXfVj4lsH4DNUQ+kAOjn3ZjhlJZA5mwnNiW7GyCTilk 0UPjFny0SQDI+GDTHxktgoSBrQl7tRgoLrSVySY42KUQ59Q+4w FhR/yUz4gLv/dG/RkqP admin@eucalyptus
But the nodes ~/eucalyptus/.ssh/authorized_keys shows an ssh-rsa ending with “EAo1E1mPRQ== eucalyptus@cloud1” which matches the front end’s .ssh/id_rsa.pub when logged in as eucalyptus@cloud1.
The nc.log shows the error “libvirt: Domain not found: no domain with matching name 'i-4A950821' (code=42)”
The node’s euca_test_nc.log shows no errors.
The command “virsh –c qemu:///system list” does show a connection on the node.
My first installation of ubuntu 9.10 server with eucalyptus cloud was an upgrade from ubuntu 9.04 desktop. One online user’s solution to a similar problem (defect #461186) was to install the cluster (front end) ubuntu 9.10 server on a laptop that had not been previously installed with Ubuntu. The current installation is a laptop that has only been installed with ubuntu 9.10 server with cloud. The node is an upgrade from 9.04 ubuntu desktop (I had attempted to install eucalyptus by hand) to 9.10 ubuntu server with cloud. I simply ran out of laptops.
The cc.log file shows the following errors/warnings grouped together:
“bad input params to vnetAttachTunnels()”
“failed to attach tunnels for vlan 10 during maintainNetworkState()”
“network state maintainance failed”
“in MANAGED-NOVLAN mode, priv interface 'eth0' must be a bridge, tunneling disabled”
The bridge setup in the node is the default when ubuntu 9.10 server with cloud was installed.
I can supply all or part of the nc.log and cc.log files.
The Ubuntu install disc was created on Monday morning 2Nov09.
During the installation, I followed the UEC/CDInstall document found at https://help.ubuntu.com/community/UEC/CDInstall (https://help.ubuntu.com/community/UEC/CDInstall) .
The setup is two Dell E6400 laptops, one as the Cloud and CC (hostname cloud1, IP=192.168.2.226), the other as the Node (hostname cloud3, IP=192.168.2.212).
VT selections in the BIOS of the laptops are set.
Eucalyptus on the CC can SSH into eucalylptus on the NC without passwords.
The image is the “Ubuntu 9.10 RC – Karmic Koala (amd64) Image Version 20091022” installed from the eucalyptus login, “store” menu. The only user is admin.
On my latest installation attempt, I ran “euca-run-instances emi-DEE31066 –k mykey –t m1.xlarge –n 1-4” , it remained pending then terminated without going into the running state. In the logs it is run on 05Nov09 at 3:47pm.
I also ran “euca-run-instances emi-DEE31066 –k mykey –t c1.medium –n 1-4” on this and previous installations this week, with the same results. The c1.medium run looked like it was creating two instances at 192.168.2.100 and 192.168.2.101.
During previous installs this week, using Ubuntu 9.10 Server with Eucalyptus cloud, I saw the same issues, but tried various solutions found online.
In this last install, I setup the partitions to include entire disk with LVM, but previous installs this week simply specified entire disk without LVM because I don’t think I need LVM.
During installation specified no email, because I plan to run all instances as admin.
One observation, when I run “euca-describe-availability-zones verbose”, I do see the cluster name with IP, and the instance sizes, but no nodes are listed. There are no nodes listed in the UEC/CDInstall example either but other documents state that the nodes should be listed.
From the logs, it does look like the CC and NC are communicating, and the NC is communicating with Walrus.
I am pretty confident that the keys and certifications match properly between the front end and node. One observation is that I see, in the cc.log file, the following ssh key:
(http://192.168.2.212:8775/axis2/services/EucalyptusNC) running instance: i-4A950821 emi-DEE31066 d0:0d:4A:95:08:21 d0:0d:4A:95:08:21 10 ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCA+qIIDTjfIWfOXe7aNN KdTGZKXisgRXZU2fFEyA3FIJqBbRNlrz2tF8mnBWvK2k4KgnQD Lrod989K+3CHHxwZRIJ8Zt3b8rg4z5MVMTdknSN3I7754BTqaQ ACWIq6ymADYEnFKBCi0QueAEdLELfMyMOLnvyyVD/hKVL8SEAAkrGBqNnmR3yfs8JutjsgjBzupanXwkEKm2SdHyZY6 6rh3kxBCwzJ/JGwoonpVXfVj4lsH4DNUQ+kAOjn3ZjhlJZA5mwnNiW7GyCTilk 0UPjFny0SQDI+GDTHxktgoSBrQl7tRgoLrSVySY42KUQ59Q+4w FhR/yUz4gLv/dG/RkqP admin@eucalyptus
But the nodes ~/eucalyptus/.ssh/authorized_keys shows an ssh-rsa ending with “EAo1E1mPRQ== eucalyptus@cloud1” which matches the front end’s .ssh/id_rsa.pub when logged in as eucalyptus@cloud1.
The nc.log shows the error “libvirt: Domain not found: no domain with matching name 'i-4A950821' (code=42)”
The node’s euca_test_nc.log shows no errors.
The command “virsh –c qemu:///system list” does show a connection on the node.
My first installation of ubuntu 9.10 server with eucalyptus cloud was an upgrade from ubuntu 9.04 desktop. One online user’s solution to a similar problem (defect #461186) was to install the cluster (front end) ubuntu 9.10 server on a laptop that had not been previously installed with Ubuntu. The current installation is a laptop that has only been installed with ubuntu 9.10 server with cloud. The node is an upgrade from 9.04 ubuntu desktop (I had attempted to install eucalyptus by hand) to 9.10 ubuntu server with cloud. I simply ran out of laptops.
The cc.log file shows the following errors/warnings grouped together:
“bad input params to vnetAttachTunnels()”
“failed to attach tunnels for vlan 10 during maintainNetworkState()”
“network state maintainance failed”
“in MANAGED-NOVLAN mode, priv interface 'eth0' must be a bridge, tunneling disabled”
The bridge setup in the node is the default when ubuntu 9.10 server with cloud was installed.
I can supply all or part of the nc.log and cc.log files.