Message boards : Number crunching : Problems with Rosetta version 5.98
Author | Message |
---|---|
David E K Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 1 Jul 05 Posts: 1018 Credit: 4,334,829 RAC: 0 |
Please post bugs/issues regarding version 5.98 here. |
Mike Francis Send message Joined: 24 Nov 05 Posts: 8 Credit: 623,519 RAC: 0 |
6/26/2008 3:31:54 PM|rosetta@home|Starting t434_1_NMRREF_1_t434_1_T0434_2QPWA_2JV0_hybridIGNORE_THE_REST_truncated_4104_1_1 6/26/2008 3:31:54 PM|rosetta@home|Starting task t434_1_NMRREF_1_t434_1_T0434_2QPWA_2JV0_hybridIGNORE_THE_REST_truncated_4104_1_1 using rosetta_beta version 598 6/26/2008 4:16:56 PM|rosetta@home|Computation for task t434_1_NMRREF_1_t434_1_T0434_2QPWA_2JV0_hybridIGNORE_THE_REST_truncated_4104_1_1 finished 6/26/2008 4:16:56 PM|rosetta@home|Output file t434_1_NMRREF_1_t434_1_T0434_2QPWA_2JV0_hybridIGNORE_THE_REST_truncated_4104_1_1_0 for task t434_1_NMRREF_1_t434_1_T0434_2QPWA_2JV0_hybridIGNORE_THE_REST_truncated_4104_1_1 absent |
[KWSN]John Galt 007 Send message Joined: 4 Aug 06 Posts: 6 Credit: 1,017,647 RAC: 0 |
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=158620863 https://boinc.bakerlab.org/rosetta/workunit.php?wuid=158616469 https://boinc.bakerlab.org/rosetta/workunit.php?wuid=158605953 All with compute errors in the first minute. |
Adam Send message Joined: 26 Jun 07 Posts: 7 Credit: 487,917 RAC: 0 |
Compute error, https://boinc.bakerlab.org/rosetta/result.php?resultid=173774049 |
P . P . L . Send message Joined: 20 Aug 06 Posts: 581 Credit: 4,865,274 RAC: 0 |
This one fell over on both hosts, same error. https://boinc.bakerlab.org/rosetta/workunit.php?wuid=158612212 Output file FRA_t449_CASP8_MANUAL_1_IGNORE_THE_RESTt449_1_ttxxxxT0449_1CHIM_0001_0001_0001_4126_3627_1_0 for task absent <core_client_version>5.10.30</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> # cpu_run_time_pref: 21600 # random seed: 2404847 ERROR:: Exit from: .loop_relax.cc line: 1745 </stderr_txt> pete. |
RC Send message Joined: 27 Sep 05 Posts: 13 Credit: 262,048 RAC: 0 |
Another compute error, https://boinc.bakerlab.org/rosetta/result.php?resultid=173797309 |
anti-cancers Send message Joined: 2 Sep 06 Posts: 9 Credit: 173,262 RAC: 0 |
|
Snags Send message Joined: 22 Feb 07 Posts: 198 Credit: 2,888,320 RAC: 0 |
This one fell over on both hosts, same error. same here |
Konstantin Iliev Send message Joined: 22 May 06 Posts: 4 Credit: 2,205,841 RAC: 0 |
Again errors as 5.96 :( https://boinc.bakerlab.org/rosetta/result.php?resultid=173787198 https://boinc.bakerlab.org/rosetta/result.php?resultid=173807571 https://boinc.bakerlab.org/rosetta/result.php?resultid=173821223 |
adrianxw Send message Joined: 18 Sep 05 Posts: 653 Credit: 11,840,739 RAC: 28 |
|
Wonderwall Send message Joined: 19 Mar 07 Posts: 1 Credit: 39,192 RAC: 0 |
Please post bugs/issues regarding version 5.98 here. rosetta@home Rosetta Beta 5.96 1405_CaspB_IUMPAB_Type2_RES81to19... 03:51:35 00.000% ...06/28/... Running high prio... |
AMD_is_logical Send message Joined: 20 Dec 05 Posts: 299 Credit: 31,460,681 RAC: 0 |
This WU did 1247 decoys, then was marked "invalid" for no apparent reason: FRA_t449_CASP8_AUTO_1SNZ_1L7J_2CIQ_1_IGNORE_THE_RESTt449_1_ttttaaT0449_1L7JA_10_0001_0001_0002_4134_634 |
Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0 |
This little bugger has been running all weekend. 47hrs on a 24hr preference. FRA_t449_CASP8_MANUAL_1_IGNORE_THE_RESTt449_1_ttxxxxT0449_1CHIM_0001_0001_0001_4142_1913 Yet it is still getting CPU time, and the step number is still incrementing. It says it is on model 151. Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ |
sslickerson Send message Joined: 14 Oct 05 Posts: 101 Credit: 578,497 RAC: 0 |
Here is a very fast error running version 5.98 on Windows XP: 174404220. It looks like it failed on at least one other host in the same manner. |
BrnmccO1 Send message Joined: 26 Jun 07 Posts: 17 Credit: 578,825 RAC: 0 |
Bizzare problem with this WU; had an 'unhandled exception error' after about approx 50 mins CPU run time, with a lenthy Std_Out: 157316144 <core_client_version>5.10.45</core_client_version> <![CDATA[ <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # cpu_run_time_pref: 10800 # random seed: 2747207 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00B3C947 read attempt to address 0x000000A4 Engaging BOINC Windows Runtime Debugger... Otherwise no other errors so far with 5.98 on both of my hosts (knocks on wood ;p) |
TeAm Enterprise Send message Joined: 28 Sep 05 Posts: 18 Credit: 27,911,183 RAC: 203 |
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=158648907 Two validate errors after full crunch. Rosetta needs to think about how to apply credit when the problems are obviously of project/WU source. Jim Crunch with friends - TeAm Anandtech |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,677,569 RAC: 10,479 |
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=158648907 Credit is applied to these as claimed - it doesn't show on the task's main page but does if you hit the Task ID link on the left. HTH Danny |
Virtual Boss* Send message Joined: 10 May 08 Posts: 35 Credit: 713,981 RAC: 0 |
WU FRA_t449_CASP8_MANUAL_1_IGNORE_THE_RESTt449_1_ttxxxxT0449_1CHIM_0001_0001_0001_4142_3294_1 using rosetta_beta version 598 Original estimated run time about 6 CPU Hrs Still Runing at 10:10:00 CPU Progress 98.386% and incrementing 0.001 about every 25 CPU secs To Completion 00:09:55 (no change last 30 CPU minutes At current % increase will take another 11+ CPU Hrs, or if Prog% is calculated from time done as % of Time done+To completion then will run forever. BTW Currently Model 22 Step 47795 |
dcdc Send message Joined: 3 Nov 05 Posts: 1832 Credit: 119,677,569 RAC: 10,479 |
WU FRA_t449_CASP8_MANUAL_1_IGNORE_THE_RESTt449_1_ttxxxxT0449_1CHIM_0001_0001_0001_4142_3294_1 using rosetta_beta version 598 the % complete and time to completion aren't linear - they're estimates, so don't worry about them if Rosetta's CPU time is increasing in task manager. Danny |
Feet1st Send message Joined: 30 Dec 05 Posts: 1755 Credit: 4,690,520 RAC: 0 |
Danny is correct about the time estimates. But the t449 I reported earlier never did finish. I let it run for 68 hours before aborting it (my runtime preference is 24hrs so I'm sure the watchdog would have discovered it after 4x that preference, but I didn't want to waste the time). My aborted task didn't seem to send the normal data in to the server. 150 presumably good models lost. So, I would suggest (if you have the patience) to exit and restart BOINC 5 times. Each time leaving it run for long enough to get itself initialized and running the problem task. Rosetta will then detect no progress after 4 or 5 restarts and more cleanly cut it off and send it in. I'm also wishing I had saved a copy of all the slots directories. Again, if you have the time, after your first exit of BOINC, I would save all the directories under your BOINC installation path with /slots on the end of the path, and EMail it to the rosettamod. Add this signature to your EMail: Running Microsoft's "System Idle Process" will never help cure cancer, AIDS nor Alzheimer's. But running Rosetta@home just might! https://boinc.bakerlab.org/rosetta/ |
Message boards :
Number crunching :
Problems with Rosetta version 5.98
©2024 University of Washington
https://www.bakerlab.org