I posted a similar article to this one about 12 hours ago over in the Security Discussions forum. Apologies to those of you who have already seen it. I'm not getting any replies there, though I thought that it would be the best forum to reach people who know about ssh. I'm casting my net wider.
I am running GROMACS 4.5.4, a package for molecular simulations. It is one of Ubuntu's supported packages. I have two nearly-identical computers, AMD six-core machines both running Ubuntu 11.10 x86_64.
GROMACS has a multiprocessing-aware version (also on Ubuntu's supported software list) which uses Open MPI to communicate between CPU's. I have been using the multi-CPU version of GROMACS on a single machine for a while now. I'm trying to harness the second machine as well over Ethernet. In other words, I'm trying to set up a little cluster.
According to the Open MPI documentation, I need a working ssh connection between my master machine and slave machines. So I have installed the openssh-server package on my slave machine. I followed the directions here to set up the public encryption key.
Once I got everything set up, I tried running GROMACS-mpi once, but it crashed. So I figured that I should just go back to basics, and simply test whether my ssh connection was working.
I can log in to the slave machine from the shell using ssh, but I see two problems. Actually, these may just be two manifestations of the same problem. Both involve random, LONG delays.
1) After I type a few commands, suddenly the cursor just freezes. It can do this while I'm in the middle of typing a command. Alternately, ssh can freeze in the middle of printing a lengthier output, say, the result of a "du" command. If I'm patient, the text that I typed right as the cursor froze will eventually appear -- but I might have to wait as long as three to five MINUTES.
2) I also receive frequent timeouts when I log off of ssh, and then try to log back in.
I'm doing this at home on my LAN. Both of my machines are sitting behind a firewall-enabled router. Local network traffic is absolutely quiet. All other functions on these two machines, including web browsing, etc. continue to function absolutely normally while these hangups occur. I have tried using ssh from the master computer, both while logged on locally to the slave computer and while not logged on. It doesn't seem to make a difference. None of the directories on either machine are encrypted, which I understand from this post can cause issues when trying to access a public key.
I suspect that one of these long delays probably also occurred when I tried to use GROMACS-mpi, and that it caused my crash.
Any suggestions? Thanks!
Bookmarks