Client errors

Message boards : Number crunching : Client errors

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 . . . 7 · Next

AuthorMessage
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1224
Credit: 13,848,401
RAC: 2,043
Message 74993 - Posted: 27 Jan 2013, 0:41:07 UTC - in response to Message 74985.  

HERE is another error message:

BOINC:: Error reading and gzipping output datafile: default.out

NO credits for the unit!!!

https://boinc.bakerlab.org/rosetta/result.php?resultid=558345592

Doesn't appear to be MY pc's problem as gzip is a part of the Rosetta program itself!!


A typical error message if some earlier error keeps file default.out from being created. Were there any such earlier errors for that workunit?
ID: 74993 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ETQuestor

Send message
Joined: 13 Nov 12
Posts: 8
Credit: 957,206
RAC: 0
Message 74996 - Posted: 27 Jan 2013, 2:27:13 UTC - in response to Message 74980.  

I will upload my task debug logs as soon as I get another task or two to complete.

For those interested in how to do this, you edit (or create, if it does not already exist) the file cc_config.xml in your main BOINC directory. Mine looks like this:

<cc_config>
<log_flags>
<task_debug>1</task_debug>
</log_flags>
</cc_config>
ID: 74996 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile JAMES DORISIO

Send message
Joined: 25 Dec 05
Posts: 15
Credit: 187,965,607
RAC: 91,604
Message 74997 - Posted: 27 Jan 2013, 16:05:28 UTC

Intel I7-3770, Ubuntu linux 12.04 amd64 ,nvidia driver 310.14, Boinc 7.0.27

task outcome client error, state invalid
https://boinc.bakerlab.org/rosetta/result.php?resultid=558662180

Boinc log less checkpointing.

Sat 26 Jan 2013 09:23:40 PM EST | rosetta@home | [task] ACTIVE_TASK::start(): forked process: pid 9248
Sat 26 Jan 2013 09:23:40 PM EST | rosetta@home | [task] task_state=EXECUTING for rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 from start
Sat 26 Jan 2013 09:23:40 PM EST | rosetta@home | Starting task rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 using minirosetta version 345 in slot 2
Sun 27 Jan 2013 04:20:49 AM EST | rosetta@home | [task] Process for rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 exited, status 0, task state 1
Sun 27 Jan 2013 04:20:49 AM EST | rosetta@home | [task] process exited with status 0
Sun 27 Jan 2013 04:20:49 AM EST | rosetta@home | [task] task_state=EXITED for rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 from handle_exited_app
Sun 27 Jan 2013 04:20:49 AM EST | rosetta@home | Computation for task rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 finished
Sun 27 Jan 2013 04:20:49 AM EST | rosetta@home | [task] result state=FILES_UPLOADING for rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 from CS::app_finished
Sun 27 Jan 2013 04:20:57 AM EST | rosetta@home | [task] result state=FILES_UPLOADED for rb_01_26_36263_68951__t000__1_C1_SAVE_ALL_OUT_IGNORE_THE_REST_72986_286_0 from CS::update_results
Sun 27 Jan 2013 04:20:57 AM EST | rosetta@home | Sending scheduler request: To report completed tasks.
Sun 27 Jan 2013 04:20:57 AM EST | rosetta@home | Reporting 1 completed tasks, not requesting new tasks

Jim



ID: 74997 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ETQuestor

Send message
Joined: 13 Nov 12
Posts: 8
Credit: 957,206
RAC: 0
Message 74998 - Posted: 27 Jan 2013, 16:42:55 UTC - in response to Message 74997.  

AMD Athlon 64 X2 Dual Core Processor 6000+ [Family 15 Model 67 Stepping 3], Fedora 18 x86_64, NVIDIA driver 310.19, Boinc 7.0.36

host: https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=1578078

task outcome client error, state invalid
https://boinc.bakerlab.org/rosetta/result.php?resultid=557755895

Boinc log less checkpointing.

25-Jan-2013 21:56:36 [rosetta@home] Starting task rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 using minirosetta version 345 in slot 0
26-Jan-2013 18:17:20 [rosetta@home] Restarting task rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 using minirosetta version 345 in slot 0
26-Jan-2013 18:23:54 [rosetta@home] [task] ACTIVE_TASK::start(): forked process: pid 2268
26-Jan-2013 18:23:54 [rosetta@home] [task] task_state=EXECUTING for rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 from start
26-Jan-2013 18:23:54 [rosetta@home] Restarting task rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 using minirosetta version 345 in slot 0
26-Jan-2013 22:34:42 [rosetta@home] [task] Process for rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 exited, status 0, task state 1
26-Jan-2013 22:34:42 [rosetta@home] [task] process exited with status 0
26-Jan-2013 22:34:42 [rosetta@home] [task] task_state=EXITED for rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 from handle_exited_app
26-Jan-2013 22:34:42 [rosetta@home] Computation for task rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 finished
26-Jan-2013 22:34:42 [rosetta@home] [task] result state=FILES_UPLOADING for rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 from CS::app_finished
26-Jan-2013 22:34:42 [rosetta@home] [task] ACTIVE_TASK::start(): forked process: pid 2525
26-Jan-2013 22:34:42 [rosetta@home] [task] task_state=EXECUTING for MB101_t000___robetta_IGNORE_THE_REST_03_09_72239_4759_0 from start
26-Jan-2013 22:34:42 [rosetta@home] Starting task MB101_t000___robetta_IGNORE_THE_REST_03_09_72239_4759_0 using minirosetta version 345 in slot 0
26-Jan-2013 22:34:45 [rosetta@home] Started upload of rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0_0
26-Jan-2013 22:34:51 [rosetta@home] Finished upload of rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0_0
26-Jan-2013 22:34:51 [rosetta@home] [task] result state=FILES_UPLOADED for rb_01_21_36039_68329_h001__if_IGNORE_THE_REST_08_03_72779_4_0 from CS::update_results
27-Jan-2013 07:50:01 [rosetta@home] update requested by user
27-Jan-2013 07:50:11 [rosetta@home] Sending scheduler request: Requested by user.
27-Jan-2013 07:50:11 [rosetta@home] Reporting 1 completed tasks
27-Jan-2013 07:50:11 [rosetta@home] Requesting new tasks for NVIDIA
27-Jan-2013 07:50:14 [rosetta@home] Scheduler request completed: got 0 new tasks
ID: 74998 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bruno

Send message
Joined: 12 Jul 10
Posts: 1
Credit: 830,470
RAC: 729
Message 75000 - Posted: 27 Jan 2013, 21:07:11 UTC

Hi

I seem to have the same problems as the other users in this thread:
https://boinc.bakerlab.org/rosetta/results.php?userid=386219

My system:
Intel(R) Core(TM) i7-3610QM CPU @ 2.30GHz
Microsoft Windows 7 Professional x64 Edition, Service Pack 1
NVIDIA GeForce driver

Best Regards

Bruno
ID: 75000 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile JAMES DORISIO

Send message
Joined: 25 Dec 05
Posts: 15
Credit: 187,965,607
RAC: 91,604
Message 75002 - Posted: 28 Jan 2013, 0:48:39 UTC
Last modified: 28 Jan 2013, 0:51:50 UTC

Log number 2 for this computer

Intel I7-3770, Ubuntu linux 12.04 amd64 ,nvidia driver 310.14, Boinc 7.0.27
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=1579123

task outcome client error, state invalid
https://boinc.bakerlab.org/rosetta/result.php?resultid=558798870

Boinc log less checkpointing.

Sun 27 Jan 2013 12:33:42 PM EST | rosetta@home | [task] ACTIVE_TASK::start(): forked process: pid 10428
Sun 27 Jan 2013 12:33:42 PM EST | rosetta@home | [task] task_state=EXECUTING for P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 from start
Sun 27 Jan 2013 12:33:42 PM EST | rosetta@home | Starting task P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 using minirosetta version 345 in slot 7
Sun 27 Jan 2013 07:33:06 PM EST | rosetta@home | [task] Process for P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 exited, status 0, task state 1
Sun 27 Jan 2013 07:33:06 PM EST | rosetta@home | [task] process exited with status 0
Sun 27 Jan 2013 07:33:06 PM EST | rosetta@home | [task] task_state=EXITED for P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 from handle_exited_app
Sun 27 Jan 2013 07:33:06 PM EST | rosetta@home | Computation for task P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 finished
Sun 27 Jan 2013 07:33:06 PM EST | rosetta@home | [task] result state=FILES_UPLOADING for P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 from CS::app_finished
Sun 27 Jan 2013 07:33:11 PM EST | rosetta@home | [task] result state=FILES_UPLOADED for P2_1_s2_f5_abinitio_design_y022_001_72082_825_0 from CS::update_results
Sun 27 Jan 2013 07:33:11 PM EST | rosetta@home | Sending scheduler request: To report completed tasks.
Sun 27 Jan 2013 07:33:11 PM EST | rosetta@home | Reporting 1 completed tasks, not requesting new tasks

Jim
ID: 75002 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 12,464
Message 75004 - Posted: 28 Jan 2013, 12:16:08 UTC

Have you guys with Nvidia gpu's tried down grading your driver to version 306.97 and seeing if that helps? It has in some cases, not all but some. MOST of us with AMD cards are not seeing the problem anymore, I am up to Boinc version 7.0.45 and am still getting the usual credits for the units crunched.
ID: 75004 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Rayburner

Send message
Joined: 4 Oct 05
Posts: 32
Credit: 16,518,823
RAC: 0
Message 75007 - Posted: 28 Jan 2013, 13:18:22 UTC - in response to Message 75004.  

Hi mikey

I have this problem from the very beginning with this box (December 2011). So this problem is much older that driver version 306.97.

Every other project (RAPLPH also!!) runs just fine on this computer.

BTW I also switched to BOINC 7.0.45

Regards,
Rayburner


Have you guys with Nvidia gpu's tried down grading your driver to version 306.97 and seeing if that helps? It has in some cases, not all but some. MOST of us with AMD cards are not seeing the problem anymore, I am up to Boinc version 7.0.45 and am still getting the usual credits for the units crunched.


ID: 75007 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 12,464
Message 75009 - Posted: 28 Jan 2013, 15:00:40 UTC - in response to Message 75007.  
Last modified: 28 Jan 2013, 15:02:49 UTC

Hi mikey

I have this problem from the very beginning with this box (December 2011). So this problem is much older that driver version 306.97.

Every other project (RAPLPH also!!) runs just fine on this computer.

BTW I also switched to BOINC 7.0.45

Regards,
Rayburner


Have you guys with Nvidia gpu's tried down grading your driver to version 306.97 and seeing if that helps? It has in some cases, not all but some. MOST of us with AMD cards are not seeing the problem anymore, I am up to Boinc version 7.0.45 and am still getting the usual credits for the units crunched.



I JUST dumped Rosetta off of one of my pc's as it REFUSED to send new work to my pc, Boinc kept saying "Not requesting tasks" even though I had NO cpu tasks on this 6 core pc! I am now crunching for Poem on that pc and it got 50 or more cpu units with NO problem!! There are WAAAY too many fish in the sea to waste time on one project that is being a PITA!!!!
ID: 75009 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Polian
Avatar

Send message
Joined: 21 Sep 05
Posts: 152
Credit: 10,141,266
RAC: 0
Message 75010 - Posted: 28 Jan 2013, 18:02:17 UTC - in response to Message 75009.  


I JUST dumped Rosetta off of one of my pc's as it REFUSED to send new work to my pc, Boinc kept saying "Not requesting tasks" even though I had NO cpu tasks on this 6 core pc! I am now crunching for Poem on that pc and it got 50 or more cpu units with NO problem!! There are WAAAY too many fish in the sea to waste time on one project that is being a PITA!!!!


Sounds more like a core client problem (bugs? in BOINC? say it's not so) than a scheduler issue.
ID: 75010 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
mikey
Avatar

Send message
Joined: 5 Jan 06
Posts: 1894
Credit: 8,767,285
RAC: 12,464
Message 75011 - Posted: 28 Jan 2013, 18:41:47 UTC - in response to Message 75010.  


I JUST dumped Rosetta off of one of my pc's as it REFUSED to send new work to my pc, Boinc kept saying "Not requesting tasks" even though I had NO cpu tasks on this 6 core pc! I am now crunching for Poem on that pc and it got 50 or more cpu units with NO problem!! There are WAAAY too many fish in the sea to waste time on one project that is being a PITA!!!!


Sounds more like a core client problem (bugs? in BOINC? say it's not so) than a scheduler issue.


I don't know I even reset the project and every other pc is working just fine on Rosetta, that one just didn't! It's okay Poem loves my time!
ID: 75011 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 75012 - Posted: 29 Jan 2013, 0:08:30 UTC

Thanks for the information!

Unfortunately I can't see anything in the logs that might suggest what is causing the issue. I'm planning to update the server scheduler to use the version that Ralph uses. Hopefully this will fix things as users suggest the Ralph server is okay and does not have this issue.

thanks again everyone.
ID: 75012 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile JAMES DORISIO

Send message
Joined: 25 Dec 05
Posts: 15
Credit: 187,965,607
RAC: 91,604
Message 75013 - Posted: 29 Jan 2013, 0:30:56 UTC

David
Please let us know when this is done. I would like to bring some computers back here if this works.
Thanks Jim
ID: 75013 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ETQuestor

Send message
Joined: 13 Nov 12
Posts: 8
Credit: 957,206
RAC: 0
Message 75016 - Posted: 29 Jan 2013, 23:21:25 UTC - in response to Message 75012.  

...I'm planning to update the server scheduler to use the version that Ralph uses...


David,

This is an interesting idea. My rosetta worked fine for months until between 7-10 days ago (I think). I assumed it was the upgrade to my NVIDIA driver, but I put it back and it didn't help. Did you make any changes to the scheduler code in that timeframe that might explain it "breaking" for me again?

Also, any ETA on when that change would happen? I'm happy to test or help in any way that I can.
ID: 75016 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 75017 - Posted: 30 Jan 2013, 0:45:53 UTC

We haven't changed the scheduler in a long time so it's likely the driver update that broke things.

Can anyone confirm that ralph does not have this issue?

I'll ask people here to submit more test jobs to Ralph.

I don't know when I'll be able to update the server but hopefully next month.

thanks!
ID: 75017 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Pushkin
Avatar

Send message
Joined: 10 Mar 07
Posts: 14
Credit: 7,068,050
RAC: 0
Message 75019 - Posted: 30 Jan 2013, 6:31:36 UTC

Hi guys,
I have been redirected to this thread by Polian, since I have the same issue like you on my Debian Wheezy machine with BOINC 7.0.27. My configuration includes both problematic hardware components - Intel(R) Core(TM) i5-3570 CPU @ 3.40GHz and nVidia GeForce GT630 graphic card.

As wrote Rayburner - this issue is older than nVidia 306.97 drivers. I am currently running nVidia 304.48 drivers and they cause this problem too.

Can I help anything more than just running Rosetta in debug mode? (<task_debug>1</task_debug>)

Greetings,
Pushkin
ID: 75019 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dcdc

Send message
Joined: 3 Nov 05
Posts: 1829
Credit: 115,561,500
RAC: 59,108
Message 75020 - Posted: 30 Jan 2013, 12:11:57 UTC - in response to Message 75019.  
Last modified: 30 Jan 2013, 12:12:25 UTC

Hi guys,
I have been redirected to this thread by Polian, since I have the same issue like you on my Debian Wheezy machine with BOINC 7.0.27. My configuration includes both problematic hardware components - Intel(R) Core(TM) i5-3570 CPU @ 3.40GHz and nVidia GeForce GT630 graphic card.

As wrote Rayburner - this issue is older than nVidia 306.97 drivers. I am currently running nVidia 304.48 drivers and they cause this problem too.

Can I help anything more than just running Rosetta in debug mode? (<task_debug>1</task_debug>)

Greetings,
Pushkin

Are you able to run a job or two with the nVidia card removed?
ID: 75020 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Pushkin
Avatar

Send message
Joined: 10 Mar 07
Posts: 14
Credit: 7,068,050
RAC: 0
Message 75021 - Posted: 30 Jan 2013, 19:15:01 UTC - in response to Message 75020.  

Hi guys,
I have been redirected to this thread by Polian, since I have the same issue like you on my Debian Wheezy machine with BOINC 7.0.27. My configuration includes both problematic hardware components - Intel(R) Core(TM) i5-3570 CPU @ 3.40GHz and nVidia GeForce GT630 graphic card.

As wrote Rayburner - this issue is older than nVidia 306.97 drivers. I am currently running nVidia 304.48 drivers and they cause this problem too.

Can I help anything more than just running Rosetta in debug mode? (<task_debug>1</task_debug>)

Greetings,
Pushkin

Are you able to run a job or two with the nVidia card removed?


Unfortunately I can't, since it is not my own computer. But you gave me an idea - I may try to run Rosetta in a session without X running, or I can try running it with opensource drivers for nVidia. I'll let you know next week.

Currently I have tried to run Rosetta with disabled compositing ... without effect, again client error (see here).
ID: 75021 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
ETQuestor

Send message
Joined: 13 Nov 12
Posts: 8
Credit: 957,206
RAC: 0
Message 75027 - Posted: 1 Feb 2013, 19:25:16 UTC - in response to Message 75017.  



Can anyone confirm that ralph does not have this issue?

I'll ask people here to submit more test jobs to Ralph.




I've signed up for Ralph with one of my trouble machines, but there don't seem to be any tasks available. As soon as I get any, I'll report the results.
ID: 75027 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ananas

Send message
Joined: 1 Jan 06
Posts: 232
Credit: 752,471
RAC: 0
Message 75036 - Posted: 4 Feb 2013, 9:10:51 UTC
Last modified: 4 Feb 2013, 9:13:35 UTC

There might be something with the project options (not properly initialized?).

If the results fail again, you could try to set the value "Target CPU run time" on the project options

p.s.: got this idea comparing the output of valid vs. invalid result, the startup output looks different in some "options" output lines.
ID: 75036 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 . . . 7 · Next

Message boards : Number crunching : Client errors



©2024 University of Washington
https://www.bakerlab.org