Vanished word units?

Message boards : Number crunching : Vanished word units?

To post messages, you must log in.

AuthorMessage
ramostol

Send message
Joined: 6 Feb 07
Posts: 64
Credit: 584,052
RAC: 0
Message 53160 - Posted: 19 May 2008, 8:28:01 UTC

For the record:

My Tasks for User Account insists that my computer has received the following wus:

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=149489497
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=149488655
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=149486989
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=149488150

The log file of my computer says no wus at all have been received in the period indicated (morning May 15th at 9:11 and 9:17 UTC). No such wus have been completed or are waiting on line, so they seem to have vanished somewhere.
ID: 53160 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Stwato

Send message
Joined: 11 Jan 06
Posts: 150
Credit: 655,634
RAC: 0
Message 53163 - Posted: 19 May 2008, 9:39:35 UTC

Somebody else reported some ghost WU's in the Cafe here.
I have had one as well. As Mod.Sense says, this is nothing to worry about.
ID: 53163 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 53180 - Posted: 19 May 2008, 22:19:56 UTC
Last modified: 19 May 2008, 22:20:27 UTC

This is interesting. The researcher who submitted these jobs has not received any results yet, but the status of the tasks show that they have been sent to clients.

There may be an issue here that we have to look into. These tasks have a large number of input files relative to other jobs.

Please post if you are having the same issues.
ID: 53180 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Helix Von Smelix

Send message
Joined: 16 Oct 05
Posts: 12
Credit: 4,026,102
RAC: 12
Message 53181 - Posted: 19 May 2008, 22:26:01 UTC
Last modified: 19 May 2008, 22:26:39 UTC

Hi,

I have been getting major download problems.

This from one box

19/05/2008 23:11:09|rosetta@home|Sending scheduler request: Requested by user. Requesting 352238 seconds of work, reporting 0 completed tasks
19/05/2008 23:11:14|rosetta@home|[error] Can't parse workunit in scheduler reply: unexpected XML tag or syntax
19/05/2008 23:11:14|rosetta@home|[error] No close tag in scheduler reply
19/05/2008 23:12:14|rosetta@home|Sending scheduler request: Requested by user. Requesting 352554 seconds of work, reporting 0 completed tasks
19/05/2008 23:12:19|rosetta@home|Scheduler request succeeded: got 0 new tasks
19/05/2008 23:12:19|rosetta@home|Message from server: Not sending work - last request too recent: 66 sec
19/05/2008 23:16:24|rosetta@home|Sending scheduler request: To fetch work. Requesting 353752 seconds of work, reporting 0 completed tasks
19/05/2008 23:16:34|rosetta@home|[error] Can't parse workunit in scheduler reply: unexpected XML tag or syntax
19/05/2008 23:16:34|rosetta@home|[error] No close tag in scheduler reply
19/05/2008 23:18:19|rosetta@home|Sending scheduler request: To fetch work. Requesting 354323 seconds of work, reporting 0 completed tasks
19/05/2008 23:18:39|rosetta@home|Scheduler request succeeded: got 0 new tasks
19/05/2008 23:18:39|rosetta@home|Message from server: Not sending work - last request too recent: 130 sec
19/05/2008 23:22:44|rosetta@home|Sending scheduler request: To fetch work. Requesting 355581 seconds of work, reporting 0 completed tasks
19/05/2008 23:22:54|rosetta@home|[error] Can't parse workunit in scheduler reply: unexpected XML tag or syntax
19/05/2008 23:22:54|rosetta@home|[error] No close tag in scheduler reply

Been going on for the last 5 hours that i know of.

Cheers
ID: 53181 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Snags

Send message
Joined: 22 Feb 07
Posts: 198
Credit: 2,828,576
RAC: 1,083
Message 53184 - Posted: 20 May 2008, 2:18:15 UTC

Just noticed this in my message log:

Mon May 19 19:20:32 2008|rosetta@home|Sending scheduler request: To fetch work. Requesting 16 seconds of work, reporting 0 completed tasks
Mon May 19 19:20:38 2008|rosetta@home|[error] Can't parse workunit in scheduler reply: unexpected XML tag or syntax
Mon May 19 19:20:38 2008|rosetta@home|[error] No close tag in scheduler reply

It appears to be this wu, BAK1b0o_loop_model_biased_it18_3335_466 ,which shows up on my tasks page but never made it to my computer - a ghost.

Snags
ID: 53184 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
AMD_is_logical

Send message
Joined: 20 Dec 05
Posts: 299
Credit: 31,460,681
RAC: 0
Message 53185 - Posted: 20 May 2008, 3:28:10 UTC

I just noticed that some of my machines have ghost WUs from around May 15.
For instance: https://boinc.bakerlab.org/rosetta/results.php?hostid=165747
has 4 WUs from May 15 that are ghosts.

The message log has something like the following for each ghost WU:

Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
Reason: To fetch work
Requesting 979 seconds of new work
Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
Can't parse work unit in scheduler reply: -112
No close tag in scheduler reply
Can't parse scheduler reply


This is from a Linux system with Boinc 5.2.13
ID: 53185 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 53188 - Posted: 20 May 2008, 4:21:43 UTC

Yep. DK is on to it. There was a problem with how the work units were created that caused the BOINC client to be unable to get off the ground with them. So, there's nothing to change or do on your end. And your client doesn't realize those work units were assigned to it, due to the XML errors, so there aren't any tasks to abort either.
Rosetta Moderator: Mod.Sense
ID: 53188 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 53191 - Posted: 20 May 2008, 4:54:49 UTC

Okay, I think I may have fixed the issue. I had to recompile our scheduling server programs with a larger xml_doc max buffer. A few batches were submitted that had so many input files that the scheduler reply from our server was truncated for these tasks. Please ignore the "ghost" work units. They will eventually get cleared.
ID: 53191 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote

Message boards : Number crunching : Vanished word units?



©2024 University of Washington
https://www.bakerlab.org