Report Maximum CPU Time Exceeded WU HERE

Message boards : Number crunching : Report Maximum CPU Time Exceeded WU HERE

To post messages, you must log in.

1 · 2 · 3 · Next

AuthorMessage
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,627,225
RAC: 10,243
Message 9935 - Posted: 26 Jan 2006, 15:12:26 UTC

Just had one. What info do you want?

	Host	Project	Date	Message
dbserver	DBSERVER	rosetta@home	26/01/2006 14:16:20	Throughput 6124 bytes/sec
dbserver	DBSERVER	rosetta@home	26/01/2006 14:16:20	Finished upload of NO_SIM_ANNEAL_2tif_228_635_2_0
dbserver	DBSERVER	rosetta@home	26/01/2006 14:16:11	Started upload of NO_SIM_ANNEAL_2tif_228_635_2_0
dbserver	DBSERVER	rosetta@home	26/01/2006 14:16:09	Starting result PRODUCTION_ABINITIO_1vls__250_2364_0 using rosetta version 481
dbserver	DBSERVER	rosetta@home	26/01/2006 14:16:08	Computation for result NO_SIM_ANNEAL_2tif_228_635_2 finished
dbserver	DBSERVER	---	26/01/2006 14:16:08	request_reschedule_cpus: process exited
dbserver	DBSERVER	rosetta@home	26/01/2006 11:53:37	Starting result NO_SIM_ANNEAL_2tif_228_635_2 using rosetta version 481
dbserver	DBSERVER	rosetta@home	26/01/2006 11:53:37	Computation for result PRODUCTION_ABINITIO_2vik__250_1426_0 finished
dbserver	DBSERVER	---	26/01/2006 11:53:37	request_reschedule_cpus: process exited
dbserver	DBSERVER	rosetta@home	26/01/2006 11:53:36	Unrecoverable error for result PRODUCTION_ABINITIO_2vik__250_1426_0 (Maximum CPU time exceeded)
dbserver	DBSERVER	rosetta@home	26/01/2006 11:53:36	Aborting result PRODUCTION_ABINITIO_2vik__250_1426_0: exceeded CPU time limit 61808.106462
dbserver	DBSERVER	rosetta@home	25/01/2006 16:54:50	Throughput 5380 bytes/sec
dbserver	DBSERVER	rosetta@home	25/01/2006 16:54:50	Finished upload of PRODUCTION_ABINITIO_1vls__250_1175_0_0
dbserver	DBSERVER	rosetta@home	25/01/2006 16:54:19	Started upload of PRODUCTION_ABINITIO_1vls__250_1175_0_0
dbserver	DBSERVER	rosetta@home	25/01/2006 16:54:17	Starting result PRODUCTION_ABINITIO_2vik__250_1426_0 using rosetta version 481
dbserver	DBSERVER	rosetta@home	25/01/2006 16:54:16	Computation for result PRODUCTION_ABINITIO_1vls__250_1175_0 finished


Duron 1600, 512MB RAM, XP Pro (fully patched)
Benchmarks were:
	Host	Project	Date	Message
dbserver	DBSERVER	---	24/01/2006 18:12:09	   2404 integer MIPS (Dhrystone) per CPU
dbserver	DBSERVER	---	24/01/2006 18:12:09	   1456 double precision MIPS (Whetstone) per CPU


ID: 9935 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 9946 - Posted: 26 Jan 2006, 17:48:05 UTC
Last modified: 27 Jan 2006, 4:43:20 UTC

Please report "Maximum Time exceeded" errors here, by providing the following information;

The Work Unit name
The result Id of the WU - Can be found on the user statistics page
The CPU time spent on the Work Unit - This is also on the user statics page
Percent complete (if known) at the time the failure - Shown in the BOINC Manager window on your system
Your system ID - also available on the user statistics page

To save some typing time, you can post a link to the work unit result. The form of the command is as follows;

[url]WEB PAGE ADDRESS GOES HERE[/url]


How to locate the information we need - With thanks to user "DCDC" for asking.

[quote]I'm not sure how to find the info you're after. I've looked through loads of results listed under the computer that the result came from at:

Home > Your Account > Computers > [choose computer] > Results

and also through the statistics tab at the top, but all i'm going on is the name of the work unit in question. I've looked through all the results - errored or not - and can't see it.

Answer:

You probably already know most of what I am about to post here so please do not be insulted at my answer.

First are you sure your WU failed as a result of a Maximum time exceeded? In every case it would have to be the longest WU you have ever seen on your machine for CPU time. These errors ALWAYS occur at almost exactly the same number of CPU seconds on a particular machine. The actual number of seconds will be different in most cases for every individual machine. If you have WUs that have run longer than the one that errored and they completed successfully then the errors you are looking at are not "Maximum Time exceeded" errors.

In any case, you went to the right place to locate the information we need. It would defiantly be listed as a client error in your stats page for that machine. So you only need to look at the error WUs. If you click on the result link, about 2/3 of the way down the page it should show the type of error. That is where you will find the phrase "Maximum Time Exceeded". It will also show the error in the messages tab on your computer in the BOINC manager. In those messages it will give you the WU name but not the result name. All of this presumes that your system has actually reported the WU back to R@H.

You can tell if it has reported by looking in the work tab in the BOINC manager If the WU is not shown there it has reported, if it shows "Ready to Report" in the status column then it has not reported yet.

Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 9946 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Los Alcoholicos~La Muis

Send message
Joined: 4 Nov 05
Posts: 34
Credit: 1,041,724
RAC: 0
Message 9962 - Posted: 26 Jan 2006, 20:43:49 UTC
Last modified: 26 Jan 2006, 20:45:52 UTC

The score of the "Maximum cpu time exeeded" error:

Dual G5 2GHz 97986 - MAX CPU time +/- 12:30
MORE_FRAGS_1ogw_222_7591_0 no id (already removed from the host result page)
INCREASE_CYCLES_10_1ogw_226_6353_1 no id (already removed from the host result page)
MORE_FRAGS_1ogw_222_7024_1 no id (already removed from the host result page)
INCREASE_CYCLES_10_logw_226_9437_0 6243867
MORE_FRAGS_W_BARCODE_logw_231_6958_0 6243401
DEFAULT_logw_220_2312_1 5106086


Powerbook G4 1.25GHz 60954 MAX CPU time +/- 14:30
PRODUCTION_ABINITIO_1dhn__250_1858_2 8021660
PRODUCTION_ABINITIO_1c9oA__250_1041_2 8021240
INCREASE_CYCLES_10_1di2_226_7035_1 no id (already removed from the host result page)

MAX CPU time +/- 40:30
PRODUCTION_ABINITIO_1wit__250_514_0 7064862


G4 2GHz 100382
MAX CPU time +/- 26:00
PRODUCTION_ABINITIO_1c9oA__250_1041_2 7088268
PRODUCTION_ABINITIO_1c9oA__250_1041_2 7074002
INCREASE_CYCLES_10_1di2_226_6975_0 no id (already removed from the host result page)


G4 1.8GHz 101675
MAX CPU time +/- 22:30
BARCODE_FRAG_30_1hz6_234_2087_0 6590138
INCREASE_CYCLES_10_1di2_226_6975_0 no id (already removed from the host result page)


AMD XP 2,6 GHz 57518 MAX CPU time +/- 10:40
BARCODE_FRAG_30_2reb_234_9869_0 no id (already removed from the host result page)
ID: 9962 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
KwintenB

Send message
Joined: 24 Nov 05
Posts: 6
Credit: 183,329
RAC: 0
Message 9976 - Posted: 27 Jan 2006, 1:21:04 UTC

I already posted on the message board, but now i see it has al special place for max cpu time
I crunched 8 hours on that WU, everythings seems normal when i upload the WU but on the site i see it give a computation error. Will i get credits for this job. Or are thes 200 thrown-away points?
PRODUCTION_ABINITIO_1fkb__250_348
ID: 9976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 9987 - Posted: 27 Jan 2006, 4:38:37 UTC
Last modified: 27 Jan 2006, 4:42:52 UTC

Please report "Maximum Time exceeded" errors here, by providing the following information;

The Work Unit name
The result Id of the WU - Can be found on the user statistics page
The CPU time spent on the Work Unit - This is also on the user statics page
Percent complete (if known) at the time the failure - Shown in the BOINC Manager window on your system
Your system ID - also available on the user statistics page

To save some typing time, you can post a link to the work unit result. The form of the command is as follows;

[url]WEB PAGE ADDRESS GOES HERE[/url]


How to locate the information we need - With thanks to user "DCDC" for asking.

[quote]I'm not sure how to find the info you're after. I've looked through loads of results listed under the computer that the result came from at:

Home > Your Account > Computers > [choose computer] > Results

and also through the statistics tab at the top, but all i'm going on is the name of the work unit in question. I've looked through all the results - errored or not - and can't see it.

Answer:

You probably already know most of what I am about to post here so please do not be insulted at my answer.

First are you sure your WU failed as a result of a Maximum time exceeded? In every case it would have to be the longest WU you have ever seen on your machine for CPU time. These errors ALWAYS occur at almost exactly the same number of CPU seconds on a particular machine. The actual number of seconds will be different in most cases for every individual machine. If you have WUs that have run longer than the one that errored and they completed successfully then the errors you are looking at are not "Maximum Time exceeded" errors.

In any case, you went to the right place to locate the information we need. It would defiantly be listed as a client error in your stats page for that machine. So you only need to look at the error WUs. If you click on the result link, about 2/3 of the way down the page it should show the type of error. That is where you will find the phrase "Maximum Time Exceeded". It will also show the error in the messages tab on your computer in the BOINC manager. In those messages it will give you the WU name but not the result name. All of this presumes that your system has actually reported the WU back to R@H.

You can tell if it has reported by looking in the work tab in the BOINC manager If the WU is not shown there it has reported, if it shows "Ready to Report" in the status column then it has not reported yet.

Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 9987 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Richard M
Avatar

Send message
Joined: 17 Sep 05
Posts: 13
Credit: 320,417
RAC: 0
Message 9999 - Posted: 27 Jan 2006, 7:47:55 UTC
Last modified: 27 Jan 2006, 7:49:11 UTC

ID: 9999 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile scsimodo

Send message
Joined: 17 Sep 05
Posts: 93
Credit: 946,359
RAC: 0
Message 10006 - Posted: 27 Jan 2006, 9:16:36 UTC

ID: 10006 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dave Wilson

Send message
Joined: 8 Jan 06
Posts: 35
Credit: 379,049
RAC: 0
Message 10008 - Posted: 27 Jan 2006, 9:59:45 UTC

Here are mine,
All are from the only machine I have running Rosetta and it is a Powerbook (Lumbard) with a G4 500 upgrade card,

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=4928068
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=4928067
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=4928030
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=4928007
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=4928006

As you can see all times are within 1 sec. of each other.

6182438 - 4928068 - 8 Jan 2006 19:50:28 UTC 25 Jan 2006 19:10:41 UTC Over Client error Computing 72,088.09 - 140.56
6182437 - 4928067 - 8 Jan 2006 19:50:28 UTC 22 Jan 2006 23:15:01 UTC Over Client error Computing 72,087.20 - 140.56
6182399 - 4928030 - 8 Jan 2006 19:50:28 UTC 21 Jan 2006 05:38:12 UTC Over Client error Computing 72,087.83 - 140.56
6182376 - 4928007 - 8 Jan 2006 19:50:28 UTC 24 Jan 2006 22:46:18 UTC Over Client error Computing 72,087.35 - 140.56
6182375 - 4928006 - 8 Jan 2006 19:50:28 UTC 23 Jan 2006 20:34:42 UTC Over Client error Computing 72,087.19 - 140.56
ID: 10008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
[DPC] Frederik

Send message
Joined: 5 Nov 05
Posts: 1
Credit: 154,433
RAC: 0
Message 10009 - Posted: 27 Jan 2006, 10:11:10 UTC

had one running for 10 hours, 1%:
https://boinc.bakerlab.org/rosetta/result.php?resultid=6843024

and onother one that I ended trough the gui, was running 6 hours and still 1%:
https://boinc.bakerlab.org/rosetta/result.php?resultid=6856048
ID: 10009 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Los Alcoholicos~Megaflix

Send message
Joined: 10 Nov 05
Posts: 24
Credit: 77,199
RAC: 0
Message 10010 - Posted: 27 Jan 2006, 10:48:14 UTC
Last modified: 27 Jan 2006, 10:49:02 UTC

7257012 5796373 20 Jan 2006 10:48:10 UTC 26 Jan 2006 16:19:23 UTC Over Client error Done 45,491.09

https://boinc.bakerlab.org/rosetta/workunit.php?wuid=5796373
ID: 10010 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1831
Credit: 119,627,225
RAC: 10,243
Message 10011 - Posted: 27 Jan 2006, 11:26:08 UTC
Last modified: 27 Jan 2006, 11:27:35 UTC

OK, re my original post I've added the info requested ;) - you can remove my original post if it helps the housekeeping?

PRODUCTION_ABINITIO_2vik__250_1426_0
Computer ID: 52945
CPU Time: 61809.367506
% Complete unknown

HTH
Danny
ID: 10011 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cureseekers~VortoN

Send message
Joined: 11 Nov 05
Posts: 3
Credit: 1,396,786
RAC: 0
Message 10053 - Posted: 27 Jan 2006, 19:49:27 UTC

Ok, some problems with PRODUCTION_ABINITIO WU's

The first couple:

Work unit name - PRODUCTION_ABINITIO_1rnbA_250_1194
The result Id - 7137340
The CPU time spent on the Work Unit - 8,979.84
Percent complete - not known
Your system ID - 64806
A link to the wu: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=5698305

Work unit name - PRODUCTION_ABINITIO_1louA_250_1180
The result Id - 7135590
The CPU time spent on the Work Unit - 27,758.20
Percent complete - not known
Your system ID - 64806
A link to the wu: https://boinc.bakerlab.org/rosetta/workunit.php?wuid=5696897

And a lot more to come when the rest of my PC's start flushing... Im in offline mode on all PC.
Total over 3000 credits worth of CPU time wasted because of these PRODUCTION_ABANITIO jobs... hope i at least get SOME credit/points for them...
ID: 10053 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
dcryor

Send message
Joined: 1 Dec 05
Posts: 1
Credit: 158,605
RAC: 0
Message 10055 - Posted: 27 Jan 2006, 20:29:01 UTC - in response to Message 9946.  


https://boinc.bakerlab.org/rosetta/results.php?hostid=83248

work unit 5427416
result ID 6812419
no CPU time shown on user page (!?)
it was about 90% complete

Error message:
1/27/2006 8:24:49 AM||Starting BOINC client version 5.2.13 for windows_intelx86
1/27/2006 8:24:49 AM||libcurl/7.14.0 OpenSSL/0.9.8 zlib/1.2.3
1/27/2006 8:24:49 AM||Data directory: C:Program FilesBOINC
1/27/2006 8:24:49 AM||Processor: 1 GenuineIntel Mobile Intel(R) Pentium(R) 4 - M CPU 1.80GHz
1/27/2006 8:24:49 AM||Memory: 511.43 MB physical, 1.93 GB virtual
1/27/2006 8:24:49 AM||Disk: 27.94 GB total, 20.50 GB free
1/27/2006 8:24:49 AM|rosetta@home|Computer ID: 83248; location: home; project prefs: default
1/27/2006 8:24:49 AM||General prefs: from rosetta@home (last modified 2006-01-23 17:12:57)
1/27/2006 8:24:49 AM||General prefs: no separate prefs for home; using your defaults
1/27/2006 8:24:50 AM||Remote control not allowed; using loopback address
1/27/2006 9:54:19 AM|rosetta@home|Deferring computation for result PRODUCTION_ABINITIO_1aiu__239_1890_0
1/27/2006 9:54:19 AM||Resuming computation and network activity
1/27/2006 9:54:19 AM||request_reschedule_cpus: Resuming activities
1/27/2006 9:54:22 AM|rosetta@home|Restarting result PRODUCTION_ABINITIO_1aiu__239_1890_0 using rosetta version 481
1/27/2006 2:22:53 PM||Suspending computation and network activity - user is active
1/27/2006 2:22:53 PM|rosetta@home|Pausing result PRODUCTION_ABINITIO_1aiu__239_1890_0 (left in memory)
1/27/2006 2:31:49 PM||Resuming computation and network activity
1/27/2006 2:31:49 PM||request_reschedule_cpus: Resuming activities
1/27/2006 2:31:49 PM|rosetta@home|Resuming result PRODUCTION_ABINITIO_1aiu__239_1890_0 using rosetta version 481
1/27/2006 2:33:35 PM|rosetta@home|Aborting result PRODUCTION_ABINITIO_1aiu__239_1890_0: exceeded CPU time limit 99000.250105
1/27/2006 2:33:35 PM|rosetta@home|Unrecoverable error for result PRODUCTION_ABINITIO_1aiu__239_1890_0 (Maximum CPU time exceeded)
1/27/2006 2:33:38 PM||request_reschedule_cpus: process exited
1/27/2006 2:33:38 PM|rosetta@home|Computation for result PRODUCTION_ABINITIO_1aiu__239_1890_0 finished
1/27/2006 2:34:37 PM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
1/27/2006 2:34:37 PM|rosetta@home|Reason: To fetch work
1/27/2006 2:34:37 PM|rosetta@home|Requesting 8640 seconds of new work, and reporting 1 results
1/27/2006 2:34:42 PM|rosetta@home|Scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi succeeded
1/27/2006 2:34:44 PM|rosetta@home|Started download of 1mkyA.psipred_ss2.gz


Some of the message related to pausing and restarting deleted for brevity
ID: 10055 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
KwintenB

Send message
Joined: 24 Nov 05
Posts: 6
Credit: 183,329
RAC: 0
Message 10056 - Posted: 27 Jan 2006, 20:32:40 UTC - in response to Message 9976.  
Last modified: 27 Jan 2006, 20:34:33 UTC

I already posted on the message board, but now i see it has al special place for max cpu time
I crunched 8 hours on that WU, everythings seems normal when i upload the WU but on the site i see it give a computation error. Will i get credits for this job. Or are thes 200 thrown-away points?
PRODUCTION_ABINITIO_1fkb__250_348


Post this before the intervention of the moderator. My job was 100% completed, and yes i'm sure it was a Maximum CPU Time Exceeded.

Hope we'll get credits for these jobs.
Thank you in advance
ID: 10056 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Koen

Send message
Joined: 29 Sep 05
Posts: 8
Credit: 8,542,574
RAC: 0
Message 10059 - Posted: 27 Jan 2006, 20:58:31 UTC

Got one too:

WU 5705193

Last time I checked the WU was at 97,5%

K.
ID: 10059 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
XS_Duc
Avatar

Send message
Joined: 30 Dec 05
Posts: 17
Credit: 310,471
RAC: 0
Message 10076 - Posted: 28 Jan 2006, 0:35:49 UTC

Only one until now... hope it stays that way.

WU5722062
The weak shall perish...
ID: 10076 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile scsimodo

Send message
Joined: 17 Sep 05
Posts: 93
Credit: 946,359
RAC: 0
Message 10101 - Posted: 28 Jan 2006, 10:16:51 UTC

Next one for me:

https://boinc.bakerlab.org/rosetta/result.php?resultid=7329736

percent completed: somewhere around 90% (guessed)

Please shorten the WUs, I already crunchend nearly 50 Hours for nothing, thats about 600 Credits!!

ID: 10101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
KwintenB

Send message
Joined: 24 Nov 05
Posts: 6
Credit: 183,329
RAC: 0
Message 10177 - Posted: 29 Jan 2006, 10:40:08 UTC

Got another one with the CPU Time Exceeded WU error
PRODUCTION_ABINITIO_1fkb__250_348_1
The remarkable thing is, that the wu crunched as long as last one reported with a CPU time Excession.
Togheter this is a lost of 425 credits :-(
ID: 10177 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote
XS_Duc
Avatar

Send message
Joined: 30 Dec 05
Posts: 17
Credit: 310,471
RAC: 0
Message 10199 - Posted: 29 Jan 2006, 17:35:08 UTC
Last modified: 29 Jan 2006, 17:44:58 UTC

Had another one today...

WU5584929
The weak shall perish...
ID: 10199 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Cureseekers~VortoN

Send message
Joined: 11 Nov 05
Posts: 3
Credit: 1,396,786
RAC: 0
Message 10221 - Posted: 30 Jan 2006, 15:51:23 UTC - in response to Message 10199.  

One failing WU of another system, more will follow in 10 hours or so.

PRODUCTION_ABINITIO_1ten__250_676_0
ID: 10221 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · Next

Message boards : Number crunching : Report Maximum CPU Time Exceeded WU HERE



©2024 University of Washington
https://www.bakerlab.org