GPUgrid - no communication on 2 of 3 PCs

Message boards : Projects : GPUgrid - no communication on 2 of 3 PCs
Message board moderation

To post messages, you must log in.

AuthorMessage
Magiceye04

Send message
Joined: 13 Aug 10
Posts: 24
Germany
Message 88877 - Posted: 17 Nov 2018, 12:35:13 UTC

Hello,

i have big trouble with 2 of my PCs to get connected to the GPUGRID Servers.
At the beginning (2 or 3 days ago) the connection was slow and sometimes ended in timeout. But reload did help.
Now GPUgrid ist completely dead. The Website is unreachable, the WUs wont get uploaded.

But a ping to www.gpugrid.net works, only about 10% packet loss.
On the 3rd PC everything is fine.

All 3 PCs are connected to the same router.
The not working PCs have Ubuntu16LTS and the working PC has Ubuntu 18LTS.

The rest of the internet is working also fine.

Any ideas why 2 of my PCs cant talk any more with GPUGRID?
Reboot didnt help.

One thing that might be one piece of the problem: i cloned the SSD of one PC somedays ago and used the clone in the other PC. But the PCs got different names and the IDs in the project are also different. Und at the beginning that was no problem. Both PCs got work and delivered the results. And other projects can communicate with their servers without problems.

Traceroute looks quite similar.
[spoiler]
traceroute of not communicating PC:
traceroute to www.gpugrid.net (84.89.134.145), 30 hops max, 60 byte packets
1 fritz.box (192.168.178.1) 0.448 ms 0.598 ms 1.278 ms
2 62.155.xxx.xxx (x) 19.643 ms 19.865 ms 21.690 ms
3 f-ed8-i.F.DE.NET.DTAG.DE (217.5.95.70) 27.551 ms 27.593 ms 27.604 ms
4 80.157.201.198 (80.157.201.198) 27.908 ms 35.696 ms 35.993 ms
5 be3187.ccr42.fra03.atlas.cogentco.com (130.117.1.118) 36.079 ms 36.515 ms 41.965 ms
6 be2799.ccr41.par01.atlas.cogentco.com (154.54.58.234) 51.110 ms 36.931 ms 42.366 ms
7 be3324.ccr52.bio02.atlas.cogentco.com (130.117.2.65) 49.509 ms be3325.ccr51.bio02.atlas.cogentco.com (130.117.48.205) 48.605 ms 48.388 ms
8 be3357.ccr31.mad05.atlas.cogentco.com (130.117.1.21) 58.225 ms be3358.ccr32.mad05.atlas.cogentco.com (130.117.1.97) 58.315 ms be3357.ccr31.mad05.atlas.cogentco.com (130.117.1.21) 58.335 ms
9 be3374.agr21.mad05.atlas.cogentco.com (130.117.2.62) 58.400 ms be3375.agr22.mad05.atlas.cogentco.com (130.117.50.202) 58.466 ms *
10 be3480.nr51.b015537-1.mad05.atlas.cogentco.com (154.25.1.18) 58.511 ms be3481.nr51.b015537-1.mad05.atlas.cogentco.com (154.25.1.110) 58.583 ms 65.188 ms
11 * 149.11.68.2 (149.11.68.2) 72.797 ms 149.11.68.50 (149.11.68.50) 72.670 ms
12 * 130.206.245.122 (130.206.245.122) 77.586 ms *
13 anella-val1-router.red.rediris.es (130.206.211.70) 74.310 ms 74.347 ms 74.362 ms
14 * * *
15 84.89.159.147 (84.89.159.147) 74.431 ms * 74.467 ms
16 * * *
17 * * *
18 grosso.upf.edu (84.89.134.145) 73.621 ms !X 68.978 ms !X *


working PC:
traceroute to www.gpugrid.net (84.89.134.145), 30 hops max, 60 byte packets
1 fritz.box (192.168.178.1) 0.406 ms 1.125 ms 1.317 ms
2 62.155.xxx.xxx (xx) 17.419 ms 18.208 ms 19.095 ms
3 f-ed8-i.F.DE.NET.DTAG.DE (217.5.95.70) 26.236 ms 26.304 ms 29.490 ms
4 80.157.201.198 (80.157.201.198) 32.720 ms 40.808 ms 40.877 ms
5 be3186.ccr41.fra03.atlas.cogentco.com (130.117.0.1) 40.976 ms 41.038 ms 41.071 ms
6 be2799.ccr41.par01.atlas.cogentco.com (154.54.58.234) 54.945 ms be2800.ccr42.par01.atlas.cogentco.com (154.54.58.238) 40.347 ms be2799.ccr41.par01.atlas.cogentco.com (154.54.58.234) 40.352 ms
7 be3324.ccr52.bio02.atlas.cogentco.com (130.117.2.65) 55.123 ms 52.034 ms *
8 be3357.ccr31.mad05.atlas.cogentco.com (130.117.1.21) 59.177 ms be3358.ccr32.mad05.atlas.cogentco.com (130.117.1.97) 57.064 ms 57.268 ms
9 be3379.agr22.mad05.atlas.cogentco.com (154.54.39.146) 55.661 ms 57.581 ms be3375.agr22.mad05.atlas.cogentco.com (130.117.50.202) 52.864 ms
10 be3481.nr51.b015537-1.mad05.atlas.cogentco.com (154.25.1.110) 58.658 ms be3480.nr51.b015537-1.mad05.atlas.cogentco.com (154.25.1.18) 53.014 ms 57.263 ms
11 149.11.68.2 (149.11.68.2) 57.465 ms 149.11.68.50 (149.11.68.50) 57.372 ms 57.493 ms
12 130.206.245.122 (130.206.245.122) 65.125 ms * 66.149 ms
13 anella-val1-router.red.rediris.es (130.206.211.70) 72.201 ms 72.461 ms 73.529 ms
14 * * *
15 84.89.159.147 (84.89.159.147) 86.897 ms 69.216 ms 69.280 ms
16 * * *
17 * * *
18 grosso.upf.edu (84.89.134.145) 73.386 ms !X 74.061 ms !X 74.559 ms !X
[/spoiler]

transfer log:
[spoiler]
Sa 17 Nov 2018 10:54:48 CET | GPUGRID | update requested by user
Sa 17 Nov 2018 10:55:04 CET | GPUGRID | Started upload of e29s4_e18s5p0f36-ADRIA_FOLDPUCB_NTL9_NoTica_KCenter_20_crystal_ss_contacts_20_ntl9_3-0-1-RND5283_1_0
Sa 17 Nov 2018 10:55:04 CET | GPUGRID | Started upload of e29s4_e18s5p0f36-ADRIA_FOLDPUCB_NTL9_NoTica_KCenter_20_crystal_ss_contacts_20_ntl9_3-0-1-RND5283_1_1
Sa 17 Nov 2018 10:55:05 CET | GPUGRID | [http] [ID#873] Info: Found bundle for host www.gpugrid.org: 0x55f8b685ff30 [serially]
Sa 17 Nov 2018 10:55:05 CET | GPUGRID | [http] [ID#872] Info: Trying 84.89.134.145...
Sa 17 Nov 2018 10:55:05 CET | GPUGRID | [http] [ID#873] Info: Hostname was found in DNS cache
Sa 17 Nov 2018 10:55:05 CET | GPUGRID | [http] [ID#873] Info: Trying 84.89.134.145...

Sa 17 Nov 2018 10:55:49 CET | | [http] [ID#0] Info: Connection timed out after 120116 milliseconds
Sa 17 Nov 2018 10:55:49 CET | | [http] [ID#0] Info: Closing connection 152
Sa 17 Nov 2018 10:55:49 CET | | [http] HTTP error: Timeout was reached
Sa 17 Nov 2018 10:55:49 CET | | [http] HTTP_OP::init_get(): http://www.gpugrid.net/notices.php?u...d165fb2f5bf6c7
Sa 17 Nov 2018 10:55:49 CET | | [http] [ID#0] Info: Found bundle for host www.gpugrid.net: 0x55f8b670f0b0 [serially]
Sa 17 Nov 2018 10:55:49 CET | | [http] [ID#0] Info: Trying 84.89.134.145...
Sa 17 Nov 2018 10:55:53 CET | GPUGRID | [http] [ID#1] Info: Connection timed out after 120124 milliseconds
Sa 17 Nov 2018 10:55:53 CET | GPUGRID | [http] [ID#1] Info: Closing connection 153
Sa 17 Nov 2018 10:55:53 CET | GPUGRID | [http] HTTP error: Timeout was reached
[/spoiler]

PS: i posted this problem already in the gpugrid forum - but the access to the forum is now also not possible and maybe its a general topic and has been seen on other projects also.
ID: 88877 · Report as offensive
Magiceye04

Send message
Joined: 13 Aug 10
Posts: 24
Germany
Message 88888 - Posted: 18 Nov 2018, 13:43:59 UTC

Problem was solved.
I put the SSD of one blocked PC in tha last working PC. There i could upload the data.
In the same time i updated/refreshed on the blocked PC #2 and magically there also the upload started.
In the end all WUs are uploaded.

Big underlined note for me: GPUgrid is not amused about cloning an SSD in another PC.
ID: 88888 · Report as offensive
Jim1348

Send message
Joined: 8 Nov 10
Posts: 310
United States
Message 88889 - Posted: 18 Nov 2018, 14:47:27 UTC - in response to Message 88877.  

PS: i posted this problem already in the gpugrid forum - but the access to the forum is now also not possible and maybe its a general topic and has been seen on other projects also.

I have intermittent problems with GPUGrid all the time. It seems to be worse in the U.S., but maybe happens sometimes in Europe also.
http://www.gpugrid.net/forum_thread.php?id=4806
ID: 88889 · Report as offensive

Message boards : Projects : GPUgrid - no communication on 2 of 3 PCs

Copyright © 2024 University of California.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation.