Problems with Rosetta version 5.43

Message boards : Number crunching : Problems with Rosetta version 5.43

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

AuthorMessage
Profile Trog Dog
Avatar

Send message
Joined: 25 Nov 05
Posts: 129
Credit: 57,345
RAC: 0
Message 33817 - Posted: 31 Dec 2006, 9:17:55 UTC

Here's another one. So far the common link is 5.7.5 - I'll revert these boxes back to 5.4.11 and see if the errors follow.
ID: 33817 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile BrotherBard

Send message
Joined: 8 Oct 05
Posts: 3
Credit: 3,893,641
RAC: 0
Message 33832 - Posted: 31 Dec 2006, 14:20:06 UTC
Last modified: 31 Dec 2006, 14:38:37 UTC

I had an error on this WU on this host

Sun Dec 31 07:07:07 2006|rosetta@home|Unrecoverable error for result FRA_s011_STRUCTURAL_GENOMICS_hom001_4_s011_4_1ys7A_IGNORE_THE_REST_256_1471_4_0 (process exited with code 131 (0x83))

--Nathan

[edit: no active graphics or screensaver, BOINC Manager 5.4.9]
ID: 33832 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kashyyk

Send message
Joined: 1 Aug 06
Posts: 2
Credit: 798,780
RAC: 0
Message 33842 - Posted: 31 Dec 2006, 16:42:30 UTC - in response to Message 33814.  

It should be fine if you just run the normal linux client.


Well, it's not that easy to install a 32-bit BOINC client on a 64-bit Gentoo system. Though I could use binaries, they wouldn't be integrated into the system (init-script).
There should be an easier method (64-bit-client), which many other projects do provide.
ID: 33842 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Pat the Red

Send message
Joined: 29 Aug 06
Posts: 1
Credit: 10,117,220
RAC: 0
Message 33849 - Posted: 31 Dec 2006, 21:52:10 UTC - in response to Message 33817.  
Last modified: 31 Dec 2006, 21:58:07 UTC

Here's another one. So far the common link is 5.7.5 - I'll revert these boxes back to 5.4.11 and see if the errors follow.

I get the same error in 5.4.11. Rolling back farther may work. I finally caught Rosetta crashed in a recoverable position, usually it hangs my tower up and its the reset button to recover. I can't wait for the new year and the updated Rosetta. I'm on an Atlon 64x2 3800+ and Win XP Pro OS

I saw something ealier about more hydrogen atoms than Rosetta was designed for. Make an absurdly high limit please. If your limit is the 32bit coding, then creativity is called for.
ID: 33849 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
FluffyChicken
Avatar

Send message
Joined: 1 Nov 05
Posts: 1260
Credit: 369,635
RAC: 0
Message 33866 - Posted: 1 Jan 2007, 9:33:40 UTC - in response to Message 33842.  

It should be fine if you just run the normal linux client.


Well, it's not that easy to install a 32-bit BOINC client on a 64-bit Gentoo system. Though I could use binaries, they wouldn't be integrated into the system (init-script).
There should be an easier method (64-bit-client), which many other projects do provide.


If you do a quick search you 'll see the standard work around if you insist on still running the 64bit client and want to run rosetta@home, you'll just need to keep up to date on the client versions when they update so you don't missmatch work and client versions.

Else offer to help them develop and maintain a 64bit Gentoo client ;-)
Team mauisun.org
ID: 33866 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bossone

Send message
Joined: 6 Jan 06
Posts: 3
Credit: 37,125
RAC: 0
Message 33924 - Posted: 2 Jan 2007, 17:25:24 UTC


ID: 33924 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Splatshot

Send message
Joined: 21 Dec 06
Posts: 1
Credit: 66,739
RAC: 0
Message 33957 - Posted: 2 Jan 2007, 21:29:00 UTC - in response to Message 33653.  

I also received the following error that this user also saw:

1/2/2007 1:43:21 PM|rosetta@home|Unrecoverable error for result s023__BOINC_ABRELAX_NEWRELAXFLAGS_hom001__1456_11149_1 ( - exit code 1073807364 (0x40010004))

Except I received this error after the following:

I selected the task in the task tab in Boinc Manager. I clicked on the "Show Graphics" button. After watching it for a bit, I closed the window. That is when this error happened for me. I suspect there needs to be a way to close the graphics without killing the number crunching, at least in my case.

------

Rosetta v5.43 is regularly "hanging" on me. I'm running the Windows version w/ BOINC screen saver set. Often when I return to my machine I have to kill the rosetta process to get to my desktop.

This isn't new to 5.43. I have another machine running rosetta that does not have this problem but I also use another screen saver on it. This machine is a dual 2.8GHz machine w/ 1GB RAM.



Here's my latest log - the failures are because I killed it twice today.

12/28/2006 2:11:51 PM|rosetta@home|Resuming task s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom010__1458_12966_0 using rosetta version 543
12/28/2006 2:11:51 PM|World Community Grid|Resuming task faah1117_d138n601_x1MEU_00_1 using faah version 528
12/28/2006 2:13:29 PM||Suspending computation - user is active
12/28/2006 2:13:29 PM|rosetta@home|Pausing task s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom010__1458_12966_0 (left in memory)
12/28/2006 2:13:29 PM|World Community Grid|Pausing task faah1117_d138n601_x1MEU_00_1 (left in memory)
12/28/2006 2:13:29 PM||Suspending network activity - user is active
12/28/2006 2:14:01 PM|rosetta@home|Unrecoverable error for result s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom010__1458_12966_0 ( - exit code 1073807364 (0x40010004))
12/28/2006 2:14:01 PM|rosetta@home|Deferring scheduler requests for 1 minutes and 0 seconds
12/28/2006 2:14:01 PM||Rescheduling CPU: application exited
12/28/2006 2:14:01 PM|rosetta@home|Computation for task s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom010__1458_12966_0 finished
12/28/2006 2:23:51 PM||Resuming computation
12/28/2006 2:23:51 PM||Rescheduling CPU: Resuming computation
12/28/2006 2:23:51 PM||Resuming network activity
12/28/2006 2:23:51 PM|Einstein@Home|Started upload of file h1_0343.5_S5R1__5397_S5R1a_0_0
12/28/2006 2:23:51 PM|World Community Grid|Resuming task faah1117_d138n601_x1MEU_00_1 using faah version 528
12/28/2006 2:23:51 PM|rosetta@home|Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
12/28/2006 2:23:51 PM|rosetta@home|Reason: To fetch work
12/28/2006 2:23:51 PM|rosetta@home|Requesting 17280 seconds of new work, and reporting 1 completed tasks
12/28/2006 2:23:52 PM|Einstein@Home|Temporarily failed upload of h1_0343.5_S5R1__5397_S5R1a_0_0: HTTP file not found
12/28/2006 2:23:52 PM|Einstein@Home|Backing off 1 hours, 24 minutes and 57 seconds on upload of file h1_0343.5_S5R1__5397_S5R1a_0_0
12/28/2006 2:23:56 PM|rosetta@home|Scheduler request succeeded
12/28/2006 2:23:58 PM|rosetta@home|Started download of file hom003_s026_.fasta.gz
12/28/2006 2:23:58 PM|rosetta@home|Started download of file hom003_s026_.psipred_ss2.gz
12/28/2006 2:24:00 PM|rosetta@home|Finished download of file hom003_s026_.fasta.gz
12/28/2006 2:24:00 PM|rosetta@home|Throughput 2576 bytes/sec
12/28/2006 2:24:00 PM|rosetta@home|Finished download of file hom003_s026_.psipred_ss2.gz
12/28/2006 2:24:00 PM|rosetta@home|Throughput 14017 bytes/sec
12/28/2006 2:24:00 PM|rosetta@home|Started download of file boinc_hom003_aas026_03_05.200_v1_3.gz
12/28/2006 2:24:00 PM|rosetta@home|Started download of file boinc_hom003_aas026_09_05.200_v1_3.gz
12/28/2006 2:24:04 PM|rosetta@home|Finished download of file boinc_hom003_aas026_03_05.200_v1_3.gz
12/28/2006 2:24:04 PM|rosetta@home|Throughput 500152 bytes/sec
12/28/2006 2:24:04 PM|rosetta@home|Finished download of file boinc_hom003_aas026_09_05.200_v1_3.gz
12/28/2006 2:24:04 PM|rosetta@home|Throughput 730755 bytes/sec
12/28/2006 2:24:05 PM||Rescheduling CPU: files downloaded
12/28/2006 2:24:05 PM|rosetta@home|Starting task s026__BOINC_ABRELAX_NEWRELAXFLAGS_hom003__1462_25667_0 using rosetta version 543
12/28/2006 2:32:27 PM|Einstein@Home|Sending scheduler request to http://einstein.phys.uwm.edu/EinsteinAtHome_cgi/cgi
12/28/2006 2:32:27 PM|Einstein@Home|Reason: To report completed tasks
12/28/2006 2:32:27 PM|Einstein@Home|Requesting 17280 seconds of new work, and reporting 1 completed tasks
12/28/2006 2:32:32 PM|Einstein@Home|Scheduler request failed: HTTP file not found
12/28/2006 2:32:32 PM|Einstein@Home|Deferring scheduler requests for 1 hours, 54 minutes and 57 seconds
12/28/2006 3:12:38 PM||Suspending computation - user is active
12/28/2006 3:12:38 PM|World Community Grid|Pausing task faah1117_d138n601_x1MEU_00_1 (left in memory)
12/28/2006 3:12:38 PM|rosetta@home|Pausing task s026__BOINC_ABRELAX_NEWRELAXFLAGS_hom003__1462_25667_0 (left in memory)
12/28/2006 3:12:38 PM||Suspending network activity - user is active
12/28/2006 3:12:59 PM|rosetta@home|Unrecoverable error for result s026__BOINC_ABRELAX_NEWRELAXFLAGS_hom003__1462_25667_0 ( - exit code 1073807364 (0x40010004))
12/28/2006 3:12:59 PM|rosetta@home|Deferring scheduler requests for 1 minutes and 0 seconds
12/28/2006 3:12:59 PM||Rescheduling CPU: application exited
12/28/2006 3:12:59 PM|rosetta@home|Computation for task s026__BOINC_ABRELAX_NEWRELAXFLAGS_hom003__1462_25667_0 finished


ID: 33957 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Michael.L

Send message
Joined: 12 Nov 06
Posts: 67
Credit: 31,295
RAC: 0
Message 33976 - Posted: 2 Jan 2007, 23:21:02 UTC
Last modified: 2 Jan 2007, 23:21:55 UTC

02/01/2007 23:10:57|rosetta@home|Unrecoverable error for result FRA_s015_STRUCTURAL_GENOMICS_hom001_4_s015_4_1r89A_IGNORE_THE_REST_4_1472_12_1 (Incorrect function. (0x1) - exit code 1 (0x1))
failed after 15 seconds of run.
amd3200 w'doze sp2 home.
no graphics at the time.

ID: 33976 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Martin P.

Send message
Joined: 26 May 06
Posts: 38
Credit: 168,333
RAC: 0
Message 34008 - Posted: 3 Jan 2007, 13:45:38 UTC - in response to Message 33976.  

02/01/2007 23:10:57|rosetta@home|Unrecoverable error for result FRA_s015_STRUCTURAL_GENOMICS_hom001_4_s015_4_1r89A_IGNORE_THE_REST_4_1472_12_1 (Incorrect function. (0x1) - exit code 1 (0x1))
failed after 15 seconds of run.
amd3200 w'doze sp2 home.
no graphics at the time.


Same problem here with result https://boinc.bakerlab.org/rosetta/result.php?resultid=55257558 (Mac-client).

ID: 34008 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Martin P.

Send message
Joined: 26 May 06
Posts: 38
Credit: 168,333
RAC: 0
Message 34019 - Posted: 3 Jan 2007, 15:48:29 UTC - in response to Message 34008.  


Same problem here with result https://boinc.bakerlab.org/rosetta/result.php?resultid=55257558 (Mac-client).


And another one: https://boinc.bakerlab.org/rosetta/result.php?resultid=55257667

ID: 34019 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5690
Credit: 5,859,226
RAC: 14
Message 34027 - Posted: 3 Jan 2007, 17:34:44 UTC
Last modified: 3 Jan 2007, 17:40:31 UTC

Whats this all about? Everything has been running fine and then this happens.

3 mins and a severe crash? thats odd

1/3/2007 5:51:55 PM|rosetta@home|Starting task FRA_s015_STRUCTURAL_GENOMICS_hom001_4_s015_4_1no5A_IGNORE_THE_REST_232_1472_12_0 using rosetta version 543


1/3/2007 5:54:12 PM|rosetta@home|Unrecoverable error for result FRA_s015_STRUCTURAL_GENOMICS_hom001_4_s015_4_1no5A_IGNORE_THE_REST_232_1472_12_0 (Incorrect function. (0x1) - exit code 1 (0x1))


1/3/2007 5:54:12 PM|rosetta@home|Computation for task FRA_s015_STRUCTURAL_GENOMICS_hom001_4_s015_4_1no5A_IGNORE_THE_REST_232_1472_12_0 finished


From the results webpage:
55151776 49007970 2 Jan 2007 19:12:02 UTC 3 Jan 2007 16:55:17 UTC Over Client error Compute error 134.88 0.36

For result# 55151776 this error message is posted on the page:

<core_client_version>5.4.11</core_client_version>
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
# random seed: 3027009
ERROR:: Exit at: .refold.cc line:337

</stderr_txt>
ID: 34027 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BennyRop

Send message
Joined: 17 Dec 05
Posts: 555
Credit: 140,800
RAC: 0
Message 34044 - Posted: 3 Jan 2007, 22:20:34 UTC

Running 5.43 - and noticing days where nothing has been uploaded. This hasn't uploaded anything since Dec 30 according to the logs; and there's no mention of communication problems while trying to upload data.

i.e. 5.43 seems to have halted working for a day or days at a time since Dec 14th on this system.




ID: 34044 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
BennyRop

Send message
Joined: 17 Dec 05
Posts: 555
Credit: 140,800
RAC: 0
Message 34124 - Posted: 4 Jan 2007, 22:41:06 UTC

a little more info:

-------

2006-12-14 12:36:37 [rosetta@home] Reason: To report completed tasks
2006-12-14 12:36:37 [rosetta@home] Reporting 1 tasks
2006-12-14 12:36:43 [rosetta@home] Scheduler request succeeded
2006-12-14 16:14:05 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
2006-12-14 16:14:05 [rosetta@home] Reason: To fetch work
2006-12-14 16:14:05 [rosetta@home] Requesting 8640 seconds of new work
2006-12-14 16:14:10 [rosetta@home] Scheduler request succeeded
2006-12-14 16:14:12 [rosetta@home] Started download of file hom001_s014_.fasta.gz
2006-12-14 16:14:12 [rosetta@home] Started download of file hom001_s014_.psipred_ss2.gz
2006-12-14 16:14:15 [rosetta@home] Finished download of file hom001_s014_.fasta.gz
2006-12-14 16:14:15 [rosetta@home] Throughput 61 bytes/sec
2006-12-14 16:14:15 [rosetta@home] Started download of file boinc_hom001_aas014_03_05.200_v1_3.gz
2006-12-14 16:14:16 [rosetta@home] Finished download of file hom001_s014_.psipred_ss2.gz
2006-12-14 16:14:16 [rosetta@home] Throughput 426 bytes/sec
2006-12-14 16:14:16 [rosetta@home] Started download of file boinc_hom001_aas014_09_05.200_v1_3.gz
2006-12-14 16:14:35 [rosetta@home] Finished download of file boinc_hom001_aas014_09_05.200_v1_3.gz
2006-12-14 16:14:35 [rosetta@home] Throughput 11669 bytes/sec
2006-12-14 16:14:35 [rosetta@home] Started download of file mapback_hom017_S_00002_0010496_0.pdb.gz
2006-12-14 16:14:43 [rosetta@home] Finished download of file boinc_hom001_aas014_03_05.200_v1_3.gz
2006-12-14 16:14:43 [rosetta@home] Throughput 29957 bytes/sec
2006-12-14 16:14:43 [rosetta@home] Finished download of file mapback_hom017_S_00002_0010496_0.pdb.gz
2006-12-14 16:14:43 [rosetta@home] Throughput 1002 bytes/sec
2006-12-14 16:14:43 [rosetta@home] Started download of file mapback_hom017_S_00002_0010496_0.loopfile.gz
2006-12-14 16:14:43 [rosetta@home] Started download of file mapback_hom017_S_00002_0010496_0.obligate_loopfile.gz
2006-12-14 16:14:46 [rosetta@home] Finished download of file mapback_hom017_S_00002_0010496_0.loopfile.gz
2006-12-14 16:14:46 [rosetta@home] Throughput 63 bytes/sec
2006-12-14 16:14:46 [rosetta@home] Finished download of file mapback_hom017_S_00002_0010496_0.obligate_loopfile.gz
2006-12-14 16:14:46 [rosetta@home] Throughput 47 bytes/sec
2006-12-14 16:14:47 [---] Rescheduling CPU: files downloaded
2006-12-15 14:27:05 [---] Suspending computation - running CPU benchmarks
2006-12-15 14:27:05 [rosetta@home] Pausing task s018__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom014_S_00018_0003425_0_1446_49_0 (left in memory)
2006-12-15 14:27:05 [---] Suspending network activity - running CPU benchmarks
2006-12-15 14:27:08 [---] Running CPU benchmarks
2006-12-15 14:28:06 [---] Benchmark results:
2006-12-15 14:28:06 [---] Number of CPUs: 1
2006-12-15 14:28:06 [---] 1898 floating point MIPS (Whetstone) per CPU
2006-12-15 14:28:06 [---] 3506 integer MIPS (Dhrystone) per CPU
2006-12-15 14:28:06 [---] Finished CPU benchmarks
2006-12-15 14:28:08 [---] Resuming computation
2006-12-15 14:28:08 [---] Rescheduling CPU: Resuming computation
2006-12-15 14:28:08 [---] Resuming network activity
2006-12-15 14:28:08 [rosetta@home] Resuming task s018__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom014_S_00018_0003425_0_1446_49_0 using rosetta version 543
2006-12-17 12:00:37 [rosetta@home] Aborting task s018__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom014_S_00018_0003425_0_1446_49_0: exceeded CPU time limit 263980.263158
2006-12-17 12:00:37 [rosetta@home] Unrecoverable error for result s018__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom014_S_00018_0003425_0_1446_49_0 (Maximum CPU time exceeded)
2006-12-17 12:00:37 [rosetta@home] Deferring scheduler requests for 1 minutes and 0 seconds
2006-12-17 12:00:43 [---] Rescheduling CPU: application exited
2006-12-17 12:00:43 [rosetta@home] Computation for task s018__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom014_S_00018_0003425_0_1446_49_0 finished
2006-12-17 12:00:43 [rosetta@home] Starting task s014__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom017_S_00002_0010496_0_1447_26_0 using rosetta version 543
2006-12-17 14:24:41 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
2006-12-17 14:24:41 [rosetta@home] Reason: To report completed tasks
2006-12-17 14:24:41 [rosetta@home] Reporting 1 tasks
2006-12-17 14:24:46 [rosetta@home] Scheduler request succeeded
2006-12-17 15:43:52 [rosetta@home] Task s014__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom017_S_00002_0010496_0_1447_26_0 exited with zero status but no 'finished' file
2006-12-17 15:43:52 [rosetta@home] If this happens repeatedly you may need to reset the project.
2006-12-17 15:43:52 [---] Rescheduling CPU: application exited
2006-12-17 15:43:52 [rosetta@home] Restarting task s014__BOINC_LOOP_RELAX_IGNORE_THE_REST_hom001__IGNORE_THE_REST_mapback_hom017_S_00002_0010496_0_1447_26_0 using rosetta version 543
------
evidently twiddling it's thumbs until the reboot on the 17th.

------

2006-12-29 14:47:20 [rosetta@home] Reporting 1 tasks
2006-12-29 14:47:25 [rosetta@home] Scheduler request succeeded
2006-12-30 10:09:20 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
2006-12-30 10:09:20 [rosetta@home] Reason: To fetch work
2006-12-30 10:09:20 [rosetta@home] Requesting 175 seconds of new work
2006-12-30 10:09:25 [rosetta@home] Scheduler request succeeded
2006-12-30 10:09:27 [---] Rescheduling CPU: files downloaded
2006-12-30 12:37:24 [---] Rescheduling CPU: application exited
2006-12-30 12:37:24 [rosetta@home] Computation for task 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_4638_1 finished
2006-12-30 12:37:25 [rosetta@home] Starting task 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_8433_0 using rosetta version 543
2006-12-30 12:37:26 [rosetta@home] Started upload of file 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_4638_1_0
2006-12-30 12:37:51 [rosetta@home] Finished upload of file 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_4638_1_0
2006-12-30 12:37:51 [rosetta@home] Throughput 39040 bytes/sec
2006-12-30 14:30:11 [---] Suspending computation - running CPU benchmarks
2006-12-30 14:30:11 [rosetta@home] Pausing task 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_8433_0 (left in memory)
2006-12-30 14:30:11 [---] Suspending network activity - running CPU benchmarks
2006-12-30 14:30:14 [---] Running CPU benchmarks
2006-12-30 14:31:13 [---] Benchmark results:
2006-12-30 14:31:13 [---] Number of CPUs: 1
2006-12-30 14:31:13 [---] 1888 floating point MIPS (Whetstone) per CPU
2006-12-30 14:31:13 [---] 3480 integer MIPS (Dhrystone) per CPU
2006-12-30 14:31:13 [---] Finished CPU benchmarks
2006-12-30 14:31:15 [---] Resuming computation
2006-12-30 14:31:15 [---] Rescheduling CPU: Resuming computation
2006-12-30 14:31:15 [---] Resuming network activity
2006-12-30 14:31:15 [rosetta@home] Resuming task 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_8433_0 using rosetta version 543
2006-12-30 15:01:55 [rosetta@home] Sending scheduler request to https://boinc.bakerlab.org/rosetta_cgi/cgi
2006-12-30 15:01:55 [rosetta@home] Reason: To report completed tasks
2006-12-30 15:01:55 [rosetta@home] Reporting 1 tasks
2006-12-30 15:02:00 [rosetta@home] Scheduler request succeeded
2007-01-02 19:31:32 [---] Suspending work fetch because computer is overcommitted.
2007-01-02 20:31:29 [---] Using earliest-deadline-first scheduling because computer is overcommitted.
2007-01-04 13:18:39 [---] Exit requested by user
2007-01-04 13:18:45 [---] Rescheduling CPU: exit_tasks

To pause/resume tasks hit CTRL-C, to exit hit CTRL-BREAK

StartServiceCtrlDispatcher being called.
This may take several seconds. Please wait.
2007-01-04 13:20:58 [---] Starting BOINC client version 5.4.9 for windows_intelx86
2007-01-04 13:20:58 [---] libcurl/7.15.3 OpenSSL/0.9.8a zlib/1.2.3
2007-01-04 13:20:58 [---] Executing as a daemon
2007-01-04 13:20:58 [---] Data directory: C:Program FilesBOINC
2007-01-04 13:20:58 [---] BOINC is running as a service and as a non-system user.
2007-01-04 13:20:58 [---] No application graphics will be available.
2007-01-04 13:20:59 [---] Processor: 1 AuthenticAMD AMD Athlon(tm) 64 Processor 3000+
2007-01-04 13:20:59 [---] Memory: 1023.48 MB physical, 1.65 GB virtual
2007-01-04 13:20:59 [---] Disk: 29.29 GB total, 7.25 GB free
2007-01-04 13:20:59 [rosetta@home] URL: https://boinc.bakerlab.org/rosetta/; Computer ID: 121218; location: home; project prefs: default
2007-01-04 13:20:59 [ralph@home] URL: http://ralph.bakerlab.org/; Computer ID: 2698; location: home; project prefs: default
2007-01-04 13:20:59 [---] General prefs: from rosetta@home (last modified 2006-05-11 00:15:56)
2007-01-04 13:20:59 [---] General prefs: no separate prefs for home; using your defaults
2007-01-04 13:21:02 [---] Local control only allowed
2007-01-04 13:21:02 [---] Listening on port 31416
2007-01-04 13:21:02 [rosetta@home] Resuming task 1mmm_1_NMRREF_1_1mmm_1_id_model_12IGNORE_THE_REST_idl_1470_8433_0 using rosetta version 543
2007-01-04 13:21:22 [---] Using earliest-deadline-first scheduling because computer is overcommitted.
2007-01-04 13:21:22 [---] Suspending work fetch because computer is overcommitted.
-----
Boincmgr is claiming that the task is 64.85% done. But never finishes..

It had registered 15 hrs of work since it started on the 30th.. although it has been on 24/7.
ID: 34124 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile hedera
Avatar

Send message
Joined: 15 Jul 06
Posts: 76
Credit: 5,230,082
RAC: 212
Message 34145 - Posted: 5 Jan 2007, 5:12:52 UTC

I just had the same error as Splatshot - fastest crash I've ever seen. I was crunching away at WU 49380569 (PSH_0101_looprlx_GP120_OD1_138_146_0623_1478_12_0). I selected the task and clicked "Show Graphics" and when I clicked on the graphics window to close it, it was hung. I killed the window hard, and found I or something had also killed the task:

01/04/2007 8:51:07 PM|rosetta@home|Computation for task s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom007__1458_42900_0 finished
01/04/2007 8:51:07 PM||Starting PSH_0101_looprlx_GP120_OD1_138_146_0623_1478_12_0
01/04/2007 8:51:07 PM|rosetta@home|Starting task PSH_0101_looprlx_GP120_OD1_138_146_0623_1478_12_0 using rosetta version 543
01/04/2007 8:51:09 PM|rosetta@home|[file_xfer] Started upload of file s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom007__1458_42900_0_0
01/04/2007 8:51:12 PM|rosetta@home|[file_xfer] Finished upload of file s019__BOINC_ABRELAX_NEWRELAXFLAGS_hom007__1458_42900_0_0
01/04/2007 8:51:12 PM|rosetta@home|[file_xfer] Throughput 34571 bytes/sec

01/04/2007 9:03:48 PM|rosetta@home|Unrecoverable error for result PSH_0101_looprlx_GP120_OD1_138_146_0623_1478_12_0 ( - exit code 1073807364 (0x40010004))

01/04/2007 9:03:48 PM|rosetta@home|Deferring scheduler requests for 1 minutes and 0 seconds
01/04/2007 9:03:48 PM|rosetta@home|Computation for task PSH_0101_looprlx_GP120_OD1_138_146_0623_1478_12_0 finished
01/04/2007 9:04:52 PM|rosetta@home|Sending scheduler request: To fetch work
01/04/2007 9:04:52 PM|rosetta@home|Requesting 9317 seconds of new work, and reporting 2 completed tasks
01/04/2007 9:04:57 PM|rosetta@home|Scheduler RPC succeeded [server version 505]
01/04/2007 9:04:57 PM|rosetta@home|Deferring scheduler requests for 4 minutes and 2 seconds
01/04/2007 9:04:59 PM|rosetta@home|[file_xfer] Started download of file 1iibA_OEG.bar

... and as you see, it's downloading another one. It would be nice if we could kill the graphic display without affecting the workunit.


--hedera

Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic.

ID: 34145 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile hedera
Avatar

Send message
Joined: 15 Jul 06
Posts: 76
Credit: 5,230,082
RAC: 212
Message 34146 - Posted: 5 Jan 2007, 5:15:07 UTC

And here is the Windows event log error for my graphics crash:

Event Type: Error
Event Source: Application Hang
Event Category: (101)
Event ID: 1002
Date: 01/04/2007
Time: 9:03:47 PM
User: N/A
Computer: KAREN_8400
Description:
Hanging application rosetta_5.43_windows_intelx86.exe, version 0.0.0.0, hang module hungapp, version 0.0.0.0, hang address 0x00000000.

For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 41 70 70 6c 69 63 61 74 Applicat
0008: 69 6f 6e 20 48 61 6e 67 ion Hang
0010: 20 20 72 6f 73 65 74 74 rosett
0018: 61 5f 35 2e 34 33 5f 77 a_5.43_w
0020: 69 6e 64 6f 77 73 5f 69 indows_i
0028: 6e 74 65 6c 78 38 36 2e ntelx86.
0030: 65 78 65 20 30 2e 30 2e exe 0.0.
0038: 30 2e 30 20 69 6e 20 68 0.0 in h
0040: 75 6e 67 61 70 70 20 30 ungapp 0
0048: 2e 30 2e 30 2e 30 20 61 .0.0.0 a
0050: 74 20 6f 66 66 73 65 74 t offset
0058: 20 30 30 30 30 30 30 30 0000000
0060: 30 0

--hedera

Never be afraid to try something new. Remember that amateurs built the ark. Professionals built the Titanic.

ID: 34146 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Conan
Avatar

Send message
Joined: 11 Oct 05
Posts: 150
Credit: 4,120,022
RAC: 3,429
Message 34154 - Posted: 5 Jan 2007, 10:39:45 UTC

> Had a problem with this result
https://boinc.bakerlab.org/rosetta/result.php?resultid=54340841

It had stopped doing anything for over a day. The CPU time was not advancing and neither was the progress counters. Being a multicore machine I could see that the other cores were processing just fine but this one was doing nothing.
Machine is running Linux and has no graphics.
ID: 34154 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sam Miorelli

Send message
Joined: 16 Feb 06
Posts: 7
Credit: 1,303,044
RAC: 0
Message 34171 - Posted: 5 Jan 2007, 18:47:35 UTC

I had the same problem as Conan on my PowerMac G4 2x1Ghz machine on this workunit:
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=48934145

After 2 days of it not getting past 11 seconds of CPU time on the second processor with the first one chugging along on SETI without problems I killed it.

Also, the next workunit it downloaded crashed with this exit code:
Fri Jan 5 11:30:50 2007|rosetta@home|Unrecoverable error for result PSH_0111_looprlx_GP120_OD1_138_146_2767_1478_4_0 (process exited with code 131 (0x83))

Here's that workunit's URL:
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=49420006
ID: 34171 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile sslickerson

Send message
Joined: 14 Oct 05
Posts: 101
Credit: 578,497
RAC: 0
Message 34264 - Posted: 7 Jan 2007, 8:33:35 UTC

This WU has failed twice in the last few days.
ID: 34264 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile James Box

Send message
Joined: 18 Oct 06
Posts: 1
Credit: 44,267
RAC: 0
Message 34288 - Posted: 7 Jan 2007, 15:33:51 UTC
Last modified: 7 Jan 2007, 15:35:03 UTC

No graphics, but this one crashed on Linux:

2007-01-07 05:30:08 [rosetta@home] Unrecoverable error for result PSH_0071_looprlx_GP120_OD1_138_147_1848_1479_29_0 (process got signal 11)

Result ID 55872047
ID: 34288 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ensor
Avatar

Send message
Joined: 7 Jan 07
Posts: 6
Credit: 27,111
RAC: 0
Message 34390 - Posted: 8 Jan 2007, 21:41:15 UTC


Had a problem earlier today on this v5.43 WU.

I was away from the machine when it happened, but it looks like it may have been a graphics related problem - when I brought the monitor out of standby I was greeted with the Rosetta screen saver and was unable to get back to the desktop (or enter the BOINC manager, or any other program) until I manually cancelled the Rosetta process using the Windows task manager.

Other than this the machine was running completely normally.

Unfortunately, the BOINC logs show nothing useful....other than the Rosetta process starting at 13:08 and hogging the machine until I manually cancelled it at 17:40.

I also run eMule on this machine and it finished downloading a file at 14:40. The remains of the info window which eMule displays when it finishes a download were still visible in the lower right hand corner of the screen....

In common with other people in this thread, I got an "Application Hang" message in the windows error log when I cancelled the process.

My graphics card is an "NVIDIA GeForce4 MX 420" with latest drivers installed.


TTFN - Pete.


ID: 34390 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · 8 · Next

Message boards : Number crunching : Problems with Rosetta version 5.43



©2024 University of Washington
https://www.bakerlab.org