minirosetta 2.17

Message boards : Number crunching : minirosetta 2.17

To post messages, you must log in.

Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,744,984
RAC: 2,340
Message 70201 - Posted: 1 May 2011, 13:11:04 UTC - in response to Message 70198.  

Well guess we will have to wait for the Grad student to wake up and come on duty to fully address you question. I found a little something from the Wiki of Boinc that addresses this issue:

Why am I getting a \'Reason: Access Violation (0xc0000005) error\'?

1. Change your preferences to leave Rosetta@Home in memory, General Preferences Log in (at General Preferences if you\'re not already) -> Edit Preferences (down the bottom) -> Leave applications in memory while preempted? Check yes and click the update preferences button; also, remember to \"update\" the BOINC Client Software so that the changes are downloaded. Open the BOINC Manager and select the \"Projects Tab\", left-click on \"Rosetta@home\" to select the project, and click the \"Update\" Button.
2. An error occurred somewhere on the computer, it could have been the BOINC Client Software or the Rosetta@Home Science Application or any programme that your computer was doing at the time. This is not a Rosetta@Home specific error, as far as I am aware it happens, on occasion, in all of the BOINC Powered Projects with all of the Science Applications. Keep Rosetta@Home in memory and ignore this problem if it\'s not getting out of hand.

I\'m going to leave it at that....wait for the big experts

Yep, task completed after I restarted BOINC - put into snooze , then shutdown and started (in that order) ??

Despite the heartbeat issues, you did complete the task:

DONE :: 56 starting structures 14704.5 cpu seconds
This process generated 56 decoys from 56 attempts


So what\'s with the following

No heartbeat from core client for 30 sec - exiting
messages ?

Job had been sitting doing NOTHING for 13.5 hours (???) which I noticed and subsequently restarted BOINC.

The Windows XP PC concerned is using nVidia onboard graphics (no idea if this has any bearing)

http://boinc.bakerlab.org/rosetta/result.php?resultid=419011804

<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
[2011- 4-30 2:38:23:] :: BOINC:: Initializing ... ok.
[2011- 4-30 2:38:23:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 28800


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C910F1E read attempt to address 0x00000001

Engaging BOINC Windows Runtime Debugger...

[2011- 4-30 21:39:36:] :: BOINC:: Initializing ... ok.
[2011- 4-30 21:39:36:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage1 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage2 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage3 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage4_kk_1 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage4_kk_2 ... success!
# cpu_run_time_pref: 28800
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22: 6:36:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22: 6:36:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage1 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage2 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage3 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage4_kk_1 ... success!
# cpu_run_time_pref: 28800
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22: 7:11:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22: 7:11:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage1 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage2 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage3 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage4_kk_1 ... success!
# cpu_run_time_pref: 28800
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk1_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk2_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk3_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk4_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk5_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk6_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk7_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk8_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk9_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk10_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk11_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk12_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk13_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk14_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk15_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk16_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk17_fa ... success!
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22: 8:47:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22: 8:47:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 28800
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22:17:41:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22:17:41:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 28800
======================================================
DONE :: 56 starting structures 14704.5 cpu seconds
This process generated 56 decoys from 56 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
]]>



ID: 70201 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 70205 - Posted: 1 May 2011, 14:50:41 UTC - in response to Message 70201.  

Cheers Greg !

Well guess we will have to wait for the Grad student to wake up and come on duty to fully address you question. I found a little something from the Wiki of Boinc that addresses this issue:

Why am I getting a \'Reason: Access Violation (0xc0000005) error\'?

1. Change your preferences to leave Rosetta@Home in memory, General Preferences Log in (at General Preferences if you\'re not already) -> Edit Preferences (down the bottom) -> Leave applications in memory while preempted? Check yes and click the update preferences button; also, remember to \"update\" the BOINC Client Software so that the changes are downloaded. Open the BOINC Manager and select the \"Projects Tab\", left-click on \"Rosetta@home\" to select the project, and click the \"Update\" Button.
2. An error occurred somewhere on the computer, it could have been the BOINC Client Software or the Rosetta@Home Science Application or any programme that your computer was doing at the time. This is not a Rosetta@Home specific error, as far as I am aware it happens, on occasion, in all of the BOINC Powered Projects with all of the Science Applications. Keep Rosetta@Home in memory and ignore this problem if it\'s not getting out of hand.

I\'m going to leave it at that....wait for the big experts

Yep, task completed after I restarted BOINC - put into snooze , then shutdown and started (in that order) ??

Despite the heartbeat issues, you did complete the task:

DONE :: 56 starting structures 14704.5 cpu seconds
This process generated 56 decoys from 56 attempts


So what\'s with the following

No heartbeat from core client for 30 sec - exiting
messages ?

Job had been sitting doing NOTHING for 13.5 hours (???) which I noticed and subsequently restarted BOINC.

The Windows XP PC concerned is using nVidia onboard graphics (no idea if this has any bearing)

http://boinc.bakerlab.org/rosetta/result.php?resultid=419011804

<core_client_version>6.10.58</core_client_version>
<![CDATA[
<stderr_txt>
[2011- 4-30 2:38:23:] :: BOINC:: Initializing ... ok.
[2011- 4-30 2:38:23:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 28800


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x7C910F1E read attempt to address 0x00000001

Engaging BOINC Windows Runtime Debugger...

[2011- 4-30 21:39:36:] :: BOINC:: Initializing ... ok.
[2011- 4-30 21:39:36:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage1 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage2 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage3 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage4_kk_1 ... success!
Continuing computation from checkpoint: chk_S_00046_FragmentSampler__stage4_kk_2 ... success!
# cpu_run_time_pref: 28800
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22: 6:36:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22: 6:36:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage1 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage2 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage3 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage4_kk_1 ... success!
# cpu_run_time_pref: 28800
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22: 7:11:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22: 7:11:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage1 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage2 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage3 ... success!
Continuing computation from checkpoint: chk_S_00052_FragmentSampler__stage4_kk_1 ... success!
# cpu_run_time_pref: 28800
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk1_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk2_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk3_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk4_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk5_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk6_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk7_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk8_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk9_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk10_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk11_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk12_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk13_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk14_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk15_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk16_fa ... success!
Continuing computation from checkpoint: chk_S_00052_FastRelax__chk17_fa ... success!
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22: 8:47:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22: 8:47:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 28800
No heartbeat from core client for 30 sec - exiting
[2011- 4-30 22:17:41:] :: BOINC:: Initializing ... ok.
[2011- 4-30 22:17:41:] :: BOINC :: boinc_init()
BOINC:: Setting up shared resources ... ok.
BOINC:: Setting up semaphores ... ok.
BOINC:: Updating status ... ok.
BOINC:: Registering timer callback... ok.
BOINC:: Worker initialized successfully.
Registering options..
Registered extra options.
Initializing broker options ...
Registered extra options.
Initializing core...
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Initializing options.... ok
Options::initialize()
Options::adding_options()
Options::initialize() Check specs.
Options::initialize() End reached
Loaded options.... ok
Processed options.... ok
Initializing random generators... ok
Initialization complete.
Setting WU description ...
Unpacking zip data: ../../projects/boinc.bakerlab.org_rosetta/minirosetta_database_rev39052.zip
Unpacking WU data ...
Unpacking data: ../../projects/boinc.bakerlab.org_rosetta/casd_sr10_boinc_nmr_control.1ff3B_20_abrelax_cs_frags_tex.boinc.zip
Setting database description ...
Setting up checkpointing ...
Setting up graphics native ...
BOINC:: Worker startup.
Starting watchdog...
Watchdog active.
# cpu_run_time_pref: 28800
======================================================
DONE :: 56 starting structures 14704.5 cpu seconds
This process generated 56 decoys from 56 attempts
======================================================

BOINC :: Watchdog shutting down...
BOINC :: BOINC support services shutting down cleanly ...
called boinc_finish

</stderr_txt>
]]>







ID: 70205 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Snags

Send message
Joined: 22 Feb 07
Posts: 198
Credit: 1,954,298
RAC: 966
Message 70210 - Posted: 1 May 2011, 17:27:12 UTC - in response to Message 70201.  

Well guess we will have to wait for the Grad student to wake up and come on duty to fully address you question. I found a little something from the Wiki of Boinc that addresses this issue:

Why am I getting a \'Reason: Access Violation (0xc0000005) error\'?

1. Change your preferences to leave Rosetta@Home in memory, General Preferences Log in (at General Preferences if you\'re not already) -> Edit Preferences (down the bottom) -> Leave applications in memory while preempted? Check yes and click the update preferences button; also, remember to \"update\" the BOINC Client Software so that the changes are downloaded. Open the BOINC Manager and select the \"Projects Tab\", left-click on \"Rosetta@home\" to select the project, and click the \"Update\" Button.
2. An error occurred somewhere on the computer, it could have been the BOINC Client Software or the Rosetta@Home Science Application or any programme that your computer was doing at the time. This is not a Rosetta@Home specific error, as far as I am aware it happens, on occasion, in all of the BOINC Powered Projects with all of the Science Applications. Keep Rosetta@Home in memory and ignore this problem if it\'s not getting out of hand.

I\'m going to leave it at that....wait for the big experts

Yep, task completed after I restarted BOINC - put into snooze , then shutdown and started (in that order) ??

[quote]Despite the heartbeat issues, you did complete the task:

DONE :: 56 starting structures 14704.5 cpu seconds
This process generated 56 decoys from 56 attempts


[quote]So what\'s with the following

No heartbeat from core client for 30 sec - exiting
messages ?

Job had been sitting doing NOTHING for 13.5 hours (???) which I noticed and subsequently restarted BOINC.

The Windows XP PC concerned is using nVidia onboard graphics (no idea if this has any bearing)

http://boinc.bakerlab.org/rosetta/result.php?resultid=419011804



The \"no heartbeat\" message means the science app and BOINC client lost contact with each other. When the science application doesn\'t receive the heartbeat (the \"I\'m alive\") message from BOINC it is supposed to exit. As long as it was merely a temporary obstruction and BOINC hasn\'t actually crashed it should see that the application has stopped, restart it and proceed merrily on its way. Only when it happens repeatedly with a single task (100 times) does BOINC give up, sending that task back and starting a brand new task. If I\'m reading correctly the \"no heartbeat\" messages occurred after you had restarted BOINC and Rosetta was able to successfully complete the task despite them. They may or may not be related to the cause of the error Gregg highlighted and which may have led to a BOINC crash which it couldn\'t recover from without a restart, thus the long delay until you noticed, restarted, and set BOINC and Rosetta on their merry way again.

You might try to recall what else was running on your computer at the time of the \"no heartbeat\" messages (22:6:36, 22:7:11, 22:8:47, 22:17:41). Anti-virus, anti-spyware, some other maintenance type scan, indexing? Could be something you started deliberately or could be something running automatically in the background. I don\'t suppose you started some new process (indexing, say) between 2:38:23 and the time BOINC stopped (which, if BOINC hadn\'t been running for 13.5 hours when you restarted must have been about 8. Is that right?). That could point to the cause of the crash and, if the process was ongoing (or maybe set to check for changes, like an index or a backup), could also explain the \"no heartbeat\" messages.


Best,
Snags
ID: 70210 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TPCBF

Send message
Joined: 29 Nov 10
Posts: 108
Credit: 3,951,406
RAC: 1,310
Message 70211 - Posted: 1 May 2011, 21:20:39 UTC

Hey guys, is it really necessary to full quote the same stuff over and over again? :(

Ralf
ID: 70211 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,744,984
RAC: 2,340
Message 70212 - Posted: 2 May 2011, 0:00:18 UTC - in response to Message 70211.  

Hey guys, is it really necessary to full quote the same stuff over and over again? :(

Ralf



Well in this case it keeps everything together in one block so we can reference ALL the information, the error messages, initial complaint, possible solutions and information about the error.

This is a small enough thread it wasn\'t that big of a deal.
In bigger threads it can be a problem.

Ive been around long enough to know how some of us as a joke created a thread so long by just replying to the same quote time after time. Mod remembers this.
So this is just a pidly thread.
ID: 70212 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 70214 - Posted: 2 May 2011, 9:23:19 UTC

Think I may have \"solved\" this one and as you so rightly said, it looks like it was a hardware problem. Looking at System info messages I\'ve been getting a lot of intermittent paging problems to one of the hard disks aroud the times of the Reason: Access Violation (0xc0000005) failures

Cheers for the steer !

Ian


ID: 70214 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
TimL

Send message
Joined: 16 Sep 06
Posts: 16
Credit: 12,386,138
RAC: 1
Message 70217 - Posted: 2 May 2011, 10:24:30 UTC

FOLD_N_DOCK_YgaP_D2symm_2_SAVE_ALL_OUT_IGNORE_THE_REST_w_csts_26019_4307_0 ran for over 17 hours (Target run time = 8 hours) then failed with this error.
ID: 70217 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 689
Credit: 9,700,681
RAC: 4,643
Message 70221 - Posted: 2 May 2011, 15:36:37 UTC - in response to Message 70210.  
Last modified: 2 May 2011, 16:07:29 UTC

Well guess we will have to wait for the Grad student to wake up


The \"no heartbeat\" message means the science app and BOINC client lost contact with each other. When the science application doesn\'t receive the heartbeat (the \"I\'m alive\") message from BOINC it is supposed to exit. As long as it was merely a temporary obstruction and BOINC hasn\'t actually crashed it should see that the application has stopped, restart it and proceed merrily on its way. Only when it happens repeatedly with a single task (100 times) does BOINC give up, sending that task back and starting a brand new task. If I\'m reading correctly the \"no heartbeat\" messages occurred after you had restarted BOINC and Rosetta was able to successfully complete the task despite them. They may or may not be related to the cause of the error Gregg highlighted and which may have led to a BOINC crash which it couldn\'t recover from without a restart, thus the long delay until you noticed, restarted, and set BOINC and Rosetta on their merry way again.

You might try to recall what else was running on your computer at the time of the \"no heartbeat\" messages (22:6:36, 22:7:11, 22:8:47, 22:17:41). Anti-virus, anti-spyware, some other maintenance type scan, indexing? Could be something you started deliberately or could be something running automatically in the background. I don\'t suppose you started some new process (indexing, say) between 2:38:23 and the time BOINC stopped (which, if BOINC hadn\'t been running for 13.5 hours when you restarted must have been about 8. Is that right?). That could point to the cause of the crash and, if the process was ongoing (or maybe set to check for changes, like an index or a backup), could also explain the \"no heartbeat\" messages.


Best,
Snags


When I\'ve seen similar error messages, the Norton Internet Security antivirus program was always running in the background (no good way to shut it off other than uninstalling it). Not sure if that\'s also why I often see the BOINC Manager program lose contact with the rest of BOINC. Do the other people seeing this problem also use Norton Internet Security?
ID: 70221 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Ian_D

Send message
Joined: 21 Sep 05
Posts: 55
Credit: 4,216,173
RAC: 0
Message 70227 - Posted: 2 May 2011, 22:01:05 UTC - in response to Message 70221.  


When I\'ve seen similar error messages, the Norton Internet Security antivirus program was always running in the background (no good way to shut it off other than uninstalling it). Not sure if that\'s also why I often see the BOINC Manager program lose contact with the rest of BOINC. Do the other people seeing this problem also use Norton Internet Security?


Nope, will NOT have Norton on any of my machines.


ID: 70227 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jesse Viviano

Send message
Joined: 14 Jan 10
Posts: 41
Credit: 1,501,581
RAC: 1
Message 70254 - Posted: 5 May 2011, 23:03:56 UTC

I just got a validate error on work unit 420544516. Could someone please investigate why the validator failed here?
ID: 70254 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jesse Viviano

Send message
Joined: 14 Jan 10
Posts: 41
Credit: 1,501,581
RAC: 1
Message 70255 - Posted: 6 May 2011, 3:42:10 UTC - in response to Message 70254.  

I just got a validate error on work unit 420544516. Could someone please investigate why the validator failed here?

Oops! That should be result 420544516. The corresponding work unit number is 383771914.
ID: 70255 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Speedy
Avatar

Send message
Joined: 25 Sep 05
Posts: 159
Credit: 598,637
RAC: 0
Message 70256 - Posted: 6 May 2011, 5:21:06 UTC

420656625 FOLD_N_DOCK_dagk_D2symm got Validate state Invalid after CPU time 2010.416 run time meant to be 3 hours. corresponding work unit number 420591203 got after Validate state Invalid after CPU time 3843.709 (has debug message)
Have a crunching good day!!
ID: 70256 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
SafeAggie

Send message
Joined: 22 Oct 05
Posts: 3
Credit: 458,414
RAC: 0
Message 70272 - Posted: 7 May 2011, 18:42:48 UTC

Validate Error: ProteinG_abinitio_SAVE_ALL_OUT_design_relax_g056_009_26017_78
wuid=382515464
resultid=419989702


Validate Error: ProteinG_abinitio_SAVE_ALL_OUT_design_relax_g056_010_26017_78
wuid=382515501
resultid=419989703
ID: 70272 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 45
Credit: 13,673,820
RAC: 14,300
Message 70276 - Posted: 7 May 2011, 20:59:48 UTC
Last modified: 7 May 2011, 21:06:00 UTC

Validate error -ProteinG_abinitio_SAVE_ALL_OUT_design_relax_g061_005_26530_180_0
http://boinc.bakerlab.org/rosetta/result.php?resultid=420705463
ID: 70276 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 4871
Credit: 3,744,984
RAC: 2,340
Message 70280 - Posted: 8 May 2011, 7:09:51 UTC

Error Message: - Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x7C812AFB
Wingman also had the same problem with a little longer run time.


Tasks:

FOLD_N_DOCK_2kqt_D2symm_SAVE_ALL_OUT_IGNORE_THE_REST_26674_9746_0
http://boinc.bakerlab.org/rosetta/result.php?resultid=421054105

FOLD_N_DOCK_2kqt_D2symm_SAVE_ALL_OUT_IGNORE_THE_REST_26674_1528_0
http://boinc.bakerlab.org/rosetta/result.php?resultid=420870386

FOLD_N_DOCK_dagk_D2symm_SAVE_ALL_OUT_IGNORE_THE_REST_26520_9259_1
http://boinc.bakerlab.org/rosetta/result.php?resultid=420803687

ID: 70280 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 45
Credit: 13,673,820
RAC: 14,300
Message 70284 - Posted: 8 May 2011, 16:00:24 UTC

Validate error - ProteinG_abinitio_SAVE_ALL_OUT_design_relax_g049_008_26508_177

Both of us that crunched this unit got this error

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=383720482
ID: 70284 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 689
Credit: 9,700,681
RAC: 4,643
Message 70285 - Posted: 8 May 2011, 16:43:08 UTC
Last modified: 8 May 2011, 16:47:27 UTC

Another workunit that appeared to stop using any CPU time at all shortly after a checkpoint, but BOINC thought it was still running for about 2 more days elapsed:

pred_ECH19_lr19a_189_0003_nh.pdb_26473_588_0

However, it eventually decided that it had gone past a time limit and engaged the BOINC debugger. Could there be a problem with the BOINC debugger announcing that it is finished, and the workunit should be marked as ended?

Also, the listing of my results does not appear to contain any information on which version of minirosetta was used. 2.17 is the latest version, so I\'m assuming that one.

Not sure if the Tthrottle add-on I\'m using to prevent my computer from overheating has any effect on this problem.
ID: 70285 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 208
Credit: 7,412,464
RAC: 1,772
Message 70456 - Posted: 31 May 2011, 1:31:47 UTC

Task 426111314 ( lysozyme_var_quota_8_15_noH_SAVE_ALL_OUT_27153_445_0 ) failed immediately on Mac.

ERROR: ERROR: FragmentIO: could not open file q-noHom.frags.15mers.gz
ERROR:: Exit from: src/core/fragment/FragmentIO.cc line: 258
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

</stderr_txt>
]]>
ID: 70456 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 45
Credit: 13,673,820
RAC: 14,300
Message 70491 - Posted: 2 Jun 2011, 20:35:39 UTC

Compute error after 3 seconds

lysozyme_var_dis_8_15_SAVE_ALL_OUT_27136_429_0

both of us got the same error

ERROR: ERROR: FragmentIO: could not open file cs-lys.15mers.gz
ERROR:: Exit from: src/core/fragment/FragmentIO.cc line: 258
BOINC:: Error reading and gzipping output datafile: default.out
called boinc_finish

http://boinc.bakerlab.org/rosetta/workunit.php?wuid=388396144
ID: 70491 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Dean Costello

Send message
Joined: 8 Feb 11
Posts: 4
Credit: 12,463,863
RAC: 6,201
Message 70520 - Posted: 8 Jun 2011, 22:55:42 UTC

Hello,
I hate to leave this message because it seems like a problem that has already been answered somewhere, but I can\'t find it.

Here\'s the thing: I get the following error on my new iMac running a new version of BOINC (6.12.26)

Wed Jun 8 18:35:23 2011 | rosetta@home | Sending scheduler request: Requested by user.
Wed Jun 8 18:35:23 2011 | rosetta@home | Reporting 27 completed tasks, requesting new tasks for CPU
Wed Jun 8 18:35:23 2011 | | [error] Can\'t create HTTP response output file sched_reply_boinc.bakerlab.org_rosetta.xml
Wed Jun 8 18:35:23 2011 | rosetta@home | Scheduler request initialization failed: fopen() failed

Any ideas on this?
-
Dean Costello
ID: 70520 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 6 · 7 · 8 · 9 · 10 · Next

Message boards : Number crunching : minirosetta 2.17



©2020 University of Washington
http://www.bakerlab.org