WCG Communication Failure

Message boards : Questions and problems : WCG Communication Failure
Message board moderation

To post messages, you must log in.

AuthorMessage
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32550 - Posted: 3 May 2010, 8:57:49 UTC

I'm running Boinc 6.10.17 on Ubuntu (recently upgraded to 10.04 but I think this problem was occuring before on 9.10 as well). Basically I can't communicate with WCG at all, I have two jobs ready to upload and can't download any more:

Mon 03 May 2010 09:50:23 BST	World Community Grid	Started upload of faah12319_ZINC00243751_xmdEq_1HSI_00_0_3
Mon 03 May 2010 09:50:23 BST	World Community Grid	Started upload of CMD2_0426-KIF11A.clustersOccur-2HJH_A.clustersOccur_20_153836_155688_1_0
Mon 03 May 2010 09:50:46 BST		Project communication failed: attempting access to reference site
Mon 03 May 2010 09:50:46 BST	World Community Grid	Temporarily failed upload of faah12319_ZINC00243751_xmdEq_1HSI_00_0_3: connect() failed
Mon 03 May 2010 09:50:46 BST	World Community Grid	Backing off 1 hr 45 min 4 sec on upload of faah12319_ZINC00243751_xmdEq_1HSI_00_0_3
Mon 03 May 2010 09:50:46 BST	World Community Grid	Temporarily failed upload of CMD2_0426-KIF11A.clustersOccur-2HJH_A.clustersOccur_20_153836_155688_1_0: connect() failed
Mon 03 May 2010 09:50:46 BST	World Community Grid	Backing off 2 hr 52 min 37 sec on upload of CMD2_0426-KIF11A.clustersOccur-2HJH_A.clustersOccur_20_153836_155688_1_0
Mon 03 May 2010 09:50:47 BST		Internet access OK - project servers may be temporarily down.


I can't even reach the WCG homepage in my webbrowser (Firefox 3.6.3). Any ideas what's wrong/how to fix it?
ID: 32550 · Report as offensive
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32570 - Posted: 4 May 2010, 7:09:28 UTC - in response to Message 32551.  

I'm glad everything is working for you but unfortunately not so here. You say that you had to add a line to cc_config.xml - given that I upgraded from 9.10 I don't think I have downloaded any new WCG jobs, just finished ones that I had before so perhaps this could be the problem? What did you have to add, bearing in mind this is just 32bit not 64 bit like you.

Also, as I mentioned, the problem seems to stem a little deeper as I can't access the wcg homepage (or forums...) from here. Any ideas?
ID: 32570 · Report as offensive
Les Bayliss
Help desk expert

Send message
Joined: 25 Nov 05
Posts: 1654
Australia
Message 32572 - Posted: 4 May 2010, 7:56:25 UTC

I just used the link from "Choose projects" on the front page of this site, and WCG's front page loaded without delay, using Firefox, from Australia.

ID: 32572 · Report as offensive
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32584 - Posted: 4 May 2010, 12:48:19 UTC - in response to Message 32571.  

Well it definitely seems like the problem is at my end, and there are definitely problems. I tried each of the links in your message and all time out, and when pinging the IPs I don't get anything useful:

$ traceroute -p 443 198.20.2.246
traceroute to 198.20.2.246 (198.20.2.246), 30 hops max, 60 byte packets
 1  O2WirelessBox.lan (192.168.1.254)  93.504 ms  91.926 ms  90.199 ms
 2  * * *
 3  10.1.3.245 (10.1.3.245)  35.787 ms  39.233 ms  40.628 ms
 4  * * *
 5  10.1.2.157 (10.1.2.157)  51.483 ms  51.622 ms  51.747 ms
 6  * * *
 7  * * *
 8  * * *
 9  * * *
10  * * *
11  * * *
<snip>


That then continues with * * *

I've applied all updates available, and rebooted both my computer and the wireless router. I'm suspicious that I've only noticed a problem with WCG and not with any other projects/websites. Could it be a problem with O2 broadband or am I don't something wrong?
ID: 32584 · Report as offensive
R_D_Metcalfe

Send message
Joined: 4 May 10
Posts: 1
United Kingdom
Message 32592 - Posted: 4 May 2010, 15:25:23 UTC

Hi

I'm getting the error messages below. Just on WCG, I can access climateprdeiction.net.

Also, I can't access the WCG Website at all - are there some problems? I've been trying all weekend


04/05/2010 16:18:08|World Community Grid|Started upload of HFCC_s2_01956563_s2_0000_0_2
04/05/2010 16:18:30||Project communication failed: attempting access to reference site
04/05/2010 16:18:30|World Community Grid|Temporarily failed upload of HFCC_s2_01956563_s2_0000_0_2: connect() failed
04/05/2010 16:18:30|World Community Grid|Backing off 3 hr 54 min 14 sec on upload of HFCC_s2_01956563_s2_0000_0_2
04/05/2010 16:18:32||Internet access OK - project servers may be temporarily down.
ID: 32592 · Report as offensive
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32594 - Posted: 4 May 2010, 15:45:02 UTC - in response to Message 32586.  

Perhaps the DNS servers are not the problem: I've just changed them to the google ones by following these instructions http://www.o2help.co.uk/router-change-dns/ but the traceroute is still the save as posted above and I still cannot access the website/boinc can't access the servers.
ID: 32594 · Report as offensive
Profile Jord
Volunteer tester
Help desk expert
Avatar

Send message
Joined: 29 Aug 05
Posts: 15480
Netherlands
Message 32600 - Posted: 4 May 2010, 20:37:56 UTC

Rob is referring to this story: DNSSEC Rollout May 5th 2010.
ID: 32600 · Report as offensive
Uplinger

Send message
Joined: 11 Dec 09
Posts: 4
United States
Message 32644 - Posted: 6 May 2010, 22:05:22 UTC

WhiskyChris,

Were you able to use the IP addresses that Sekerob posted about to access the site? For example, you can add these three lines to your /etc/hosts file on ubuntu.

sudo vi /etc/hosts

198.20.8.246 www.worldcommunitygrid.org
192.20.8.246 secure.worldcommunitygrid.org
192.20.8.241 grid.worldcommunitygrid.org

Thanks,
-Uplinger
ID: 32644 · Report as offensive
Uplinger

Send message
Joined: 11 Dec 09
Posts: 4
United States
Message 32661 - Posted: 7 May 2010, 16:10:39 UTC - in response to Message 32644.  

WhiskyChris,

Were you able to use the IP addresses that Sekerob posted about to access the site? For example, you can add these three lines to your /etc/hosts file on ubuntu.

sudo vi /etc/hosts

198.20.8.246 www.worldcommunitygrid.org
192.20.8.246 secure.worldcommunitygrid.org
192.20.8.241 grid.worldcommunitygrid.org

Thanks,
-Uplinger


DOH DOH DOH DOH DOH...it's supposed to be...

198.20.8.246 www.worldcommunitygrid.org
198.20.8.246 secure.worldcommunitygrid.org
198.20.8.241 grid.worldcommunitygrid.org

Sorry for the confusion.

-Uplinger
ID: 32661 · Report as offensive
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32682 - Posted: 8 May 2010, 16:36:40 UTC - in response to Message 32644.  

WhiskyChris,

Were you able to use the IP addresses that Sekerob posted about to access the site? For example, you can add these three lines to your /etc/hosts file on ubuntu.

sudo vi /etc/hosts

198.20.8.246 www.worldcommunitygrid.org
192.20.8.246 secure.worldcommunitygrid.org
192.20.8.241 grid.worldcommunitygrid.org

Thanks,
-Uplinger


I've just run traceroute on those new IPs. They go further than the original ones but I'm not sure if they go far enough? Both IPs go the same direction, this is what I get:

$ traceroute -p 443 198.20.8.241
traceroute to 198.20.8.241 (198.20.8.241), 30 hops max, 60 byte packets
 1  O2WirelessBox.lan (192.168.1.254)  48.801 ms  47.020 ms  45.342 ms
 2  * * *
 3  10.1.3.245 (10.1.3.245)  37.500 ms  41.114 ms  41.291 ms
 4  * * *
 5  xe-9-1-0.edge3.London1.Level3.net (212.113.15.65)  55.669 ms  55.727 ms  55.886 ms
 6  ae-34-52.ebr2.London1.Level3.net (4.69.139.97)  56.008 ms  29.319 ms  29.139 ms
 7  ae-44-44.ebr1.NewYork1.Level3.net (4.69.137.78)  98.744 ms ae-43-43.ebr1.NewYork1.Level3.net (4.69.137.74)  100.039 ms ae-41-41.ebr1.NewYork1.Level3.net (4.69.137.66)  100.649 ms
 8  ae-81-81.csw3.NewYork1.Level3.net (4.69.134.74)  108.366 ms  110.475 ms ae-91-91.csw4.NewYork1.Level3.net (4.69.134.78)  103.706 ms
 9  ae-74-74.ebr4.NewYork1.Level3.net (4.69.134.117)  115.604 ms ae-64-64.ebr4.NewYork1.Level3.net (4.69.134.113)  115.389 ms ae-74-74.ebr4.NewYork1.Level3.net (4.69.134.117)  107.253 ms
10  ae-5-5.car1.Montreal2.Level3.net (4.69.141.5)  299.340 ms  227.870 ms  227.969 ms
11  ae-11-11.car2.Montreal2.Level3.net (4.69.141.1)  108.662 ms  104.682 ms  107.080 ms
12  ae-2-2.car2.Toronto2.Level3.net (4.69.140.253)  112.554 ms  112.164 ms  115.611 ms
13  bx4-toronto12_xe-5-0-0_0.net.bell.ca (67.69.246.157)  118.586 ms  117.698 ms  114.904 ms
14  * * *
15  * * *
16  * * *


Followed by further "* * *"s.

I have also added the lines you suggest to /etc/hosts but I am still unable to access the website or forums.

$ cat /etc/hosts
127.0.0.1	localhost
127.0.1.1	highlandPark

# The following lines are desirable for IPv6 capable hosts
::1     localhost ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
ff02::3 ip6-allhosts

# WCG Hosts
198.20.8.246 www.worldcommunitygrid.org
198.20.8.246 secure.worldcommunitygrid.org
198.20.8.241 grid.worldcommunitygrid.org

ID: 32682 · Report as offensive
Kevin Reed

Send message
Joined: 16 Jan 08
Posts: 5
United States
Message 32706 - Posted: 10 May 2010, 15:00:14 UTC - in response to Message 32682.  

Chris,

Can you go ahead and send an email to support@worldcommunitygrid.org and ask for Kevin Reed. In that email can you put in your public IP address? I need to search our webserver log files and see if we even see your requests reaching our servers.

thanks,
Kevin
ID: 32706 · Report as offensive
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32739 - Posted: 12 May 2010, 8:07:29 UTC - in response to Message 32706.  

I found a temporary resolution which may end up having fixed it. I connected to
the Edinburgh University VPN and from there I managed to reach the WCG website and boinc managed to upload finished jobs and download new ones.

Furthermore, having now disconnecting from the VPN I can still access the website and boinc still seems to be communicating happily. Strange, but good? I'll update you if it stops working again.

Thanks for all your ideas,
Chris
ID: 32739 · Report as offensive
Les Bayliss
Help desk expert

Send message
Joined: 25 Nov 05
Posts: 1654
Australia
Message 32740 - Posted: 12 May 2010, 8:47:12 UTC - in response to Message 32739.  

... boinc still seems to be communicating ...

Is that without re-booting?

ID: 32740 · Report as offensive
Kevin Reed

Send message
Joined: 16 Jan 08
Posts: 5
United States
Message 32860 - Posted: 18 May 2010, 14:33:38 UTC - in response to Message 32739.  

I found a temporary resolution which may end up having fixed it. I connected to
the Edinburgh University VPN and from there I managed to reach the WCG website and boinc managed to upload finished jobs and download new ones.

Furthermore, having now disconnecting from the VPN I can still access the website and boinc still seems to be communicating happily. Strange, but good? I'll update you if it stops working again.

Thanks for all your ideas,
Chris



Chris,

Can you attempt to access the website without going through your VPN. We engaged the support team for the bell.ca network and one user we were working with has reported success getting through in the past 12 hours. I want to confirm that your problem has been resolved as well. If you are still having issues can you provide a tracert with and without your VPN turned on?

Also - if you are still offline, please send an email to support@worldcommunitygrid.org because we would really like to get to the bottom of this issue.

thanks,
Kevin
ID: 32860 · Report as offensive
WhiskyChris

Send message
Joined: 3 May 10
Posts: 7
United Kingdom
Message 32866 - Posted: 18 May 2010, 23:02:39 UTC - in response to Message 32860.  

To confirm, the situation now seems to be completely resolved. BOINC is communicating fine, and I can access the website with no difficulties. Thanks again for all your help.
ID: 32866 · Report as offensive
KeithSloan

Send message
Joined: 31 May 10
Posts: 5
United Kingdom
Message 33164 - Posted: 31 May 2010, 19:04:10 UTC - in response to Message 32866.  

I am having problems setting up BOINC since moving to ubuntu 10.04. I can access the sites via Firefox but not when I try an access application manager from a boinc startup.

I had to change some Firefix settings to get it to work.

Also disable ipv6 in thunderbird.

Also I think I managed to turn off ipv6 least it says disable when I check the ethernet details.
ID: 33164 · Report as offensive
KeithSloan

Send message
Joined: 31 May 10
Posts: 5
United Kingdom
Message 33169 - Posted: 31 May 2010, 21:52:53 UTC - in response to Message 33164.  

Found some stuff related to IPv6 in /etc/hosts. Took a backup and then deleted the IpV6 stuff. Seems to have solved the problem.
ID: 33169 · Report as offensive

Message boards : Questions and problems : WCG Communication Failure

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.