Advanced search

Message boards : Number crunching : No GPUGRID Tasks Running

Author Message
Reysic
Send message
Joined: 3 Apr 11
Posts: 11
Credit: 2,052,552
RAC: 0
Level
Ala
Scientific publications
watwatwat
Message 24552 - Posted: 23 Apr 2012 | 9:21:12 UTC

I've been running a number of BOINC projects including GPUGRID all with a resource share of 100. I recently noticed that none of my hosts have completed a GPUGRID task in over a week. Furthermore, I discovered by checking my BOINC event log on a host that my hosts aren't even requesting GPUGRID tasks, much less running them. I decided I'd like to prioritize GPUGRID work over everything else, so I set GPUGRID's resource share to 100 and the resource share of all other projects to 0. However, even after forcing the clients to update the shares, they are still requesting and running tasks from other projects and not GPUGRID. Is this some sort of carryover effect from when the resource shares for all projects were equal and BOINC is trying to ensure that all the projects have done an equivalent amount of work before moving on to the updated resource shares? If so, is there any way to expedite the process and get BOINC prioritizing GPUGRID tasks now?

Thanks in advance for any replies.

MarkJ
Volunteer moderator
Volunteer tester
Send message
Joined: 24 Dec 08
Posts: 738
Credit: 200,909,904
RAC: 0
Level
Leu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24554 - Posted: 23 Apr 2012 | 11:49:44 UTC - in response to Message 24552.

Your computers are hidden so we can't seen any detail to help.

1st thing is check that it thinks you have a compatable graphics cards (according to BOINC startup messages). Make sure you have the "use GPU always" and make sure you aren't running one of the recent buggy nvidia drivers (ie 295.x or 296.x under windows).
____________
BOINC blog

Profile dskagcommunity
Avatar
Send message
Joined: 28 Apr 11
Posts: 456
Credit: 817,865,789
RAC: 0
Level
Glu
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24555 - Posted: 23 Apr 2012 | 12:37:23 UTC

Thats not a problem with gpugrid alone, im interested for some solutions too cos i cant define any backup gpu projects on any of the gpu machines (doesnt matter really). Poem For example is stronger then mw.
____________
DSKAG Austria Research Team: http://www.research.dskag.at



Reysic
Send message
Joined: 3 Apr 11
Posts: 11
Credit: 2,052,552
RAC: 0
Level
Ala
Scientific publications
watwatwat
Message 24556 - Posted: 23 Apr 2012 | 12:37:36 UTC - in response to Message 24554.

Thanks for your reply MarkJ. I checked everything you listed and it all looks good so far.

Reysic
Send message
Joined: 3 Apr 11
Posts: 11
Credit: 2,052,552
RAC: 0
Level
Ala
Scientific publications
watwatwat
Message 24565 - Posted: 23 Apr 2012 | 21:19:08 UTC

It looks like one of my hosts did finally download a GPUGRID task, but it failed shortly after starting computation with the error I've posted below. Anyone know what the error message means?

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
The system cannot find the path specified. (0x3) - exit code 3 (0x3)
</message>
<stderr_txt>
# Using device 0
# There are 2 devices supporting CUDA
# Device 0: "GeForce GTX 460"
# Clock rate: 1.80 GHz
# Total amount of global memory: 805306368 bytes
# Number of multiprocessors: 7
# Number of cores: 56
# Device 1: "GeForce GTX 460"
# Clock rate: 1.44 GHz
# Total amount of global memory: 805306368 bytes
# Number of multiprocessors: 7
# Number of cores: 56
MDIO: cannot open file "restart.coor"
SWAN: FATAL : swanMemcpyDtoH failed

Assertion failed: 0, file swanlib_nv.c, line 390

This application has requested the Runtime to terminate it in an unusual way.
Please contact the application's support team for more information.

</stderr_txt>
]]>

Profile Damaraland
Send message
Joined: 7 Nov 09
Posts: 152
Credit: 16,181,924
RAC: 0
Level
Pro
Scientific publications
watwatwatwatwatwatwatwatwat
Message 24617 - Posted: 28 Apr 2012 | 16:06:16 UTC - in response to Message 24565.

It would be usefull that you give more info of your system. Driver version. OS...
A link to any task too.

5pot
Send message
Joined: 8 Mar 12
Posts: 411
Credit: 2,083,882,218
RAC: 0
Level
Phe
Scientific publications
watwatwatwatwatwatwatwatwatwatwatwatwatwat
Message 24618 - Posted: 28 Apr 2012 | 16:17:18 UTC

Off the top of my head I can also see that you have SLI enabled as well as one of your cards is OC much higher than the other.

Post to thread

Message boards : Number crunching : No GPUGRID Tasks Running

//