Message boards : Number crunching : Rosetta 4.1+ and 4.2+
Previous · 1 . . . 27 · 28 · 29 · 30 · 31 · 32 · 33 . . . 34 · Next
Author | Message |
---|---|
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,228,659 RAC: 8,784 |
All MOF_ wus: Glad you said that. I've been tweaking my new PC very slowly, had numerous errors at one tweak level, dialled it back and seem to have found a sweet spot, apart from two task errors, but both are MOF and show the same error as you MOF_I213_12res_testasym_c.82.1_0001_I_21_3_hit_ASP_ASP_2_3_46_cell035_ncontact25_score-37_SAVE_ALL_OUT_1056212_310_0 MOF_P4132_12res_testasym_c.10.2_0001_P_41_3_2_hit_ASP_ASP_1_3_153_cell037_ncontact12_score000_SAVE_ALL_OUT_1055733_167_0 |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2125 Credit: 41,228,659 RAC: 8,784 |
All MOF_ wus: Clarification: it's not all MOF tasks for me. I have two going through now that are checkpointing fine and currently over 5hrs through out of 8 |
Kissagogo27 Send message Joined: 31 Mar 20 Posts: 86 Credit: 2,919,932 RAC: 2,098 |
https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1179951356 my computer have only 4GB of Ram, and the Mof Wu errored in few seconds .. the Wingman's computer have 128GB of ram and the Mof WU ends well ^^ Type de CPU AuthenticAMD
i don't beleive of that: Peak working set size 324.82 MB and then another anomalies ...
for an AMD EPYC 7452
for a Core 2 Duo E7600 same GFLOPS ... curious ... |
Brian Nixon Send message Joined: 12 Apr 20 Posts: 293 Credit: 8,432,366 RAC: 0 |
same GFLOPS ... curious ...The 2.78 GFLOPS is the nominal rate at which the application performs work, not a measure of the performance of any individual machine running it. It’s the mechanism by which Rosetta fits its ‘fixed duration / variable work’ approach into BOINC’s original expectation of ‘fixed work / variable duration’. (Every task is declared as having 80 000 GFLOPs of work to perform, so with an application that is declared as achieving 2.78 GFLOPs per second, the initial run-time estimate becomes 8 hours.) |
Brian Nixon Send message Joined: 12 Apr 20 Posts: 293 Credit: 8,432,366 RAC: 0 |
my computer have only 4GB of Ram, and the Mof Wu errored in few seconds ..I’m not sure memory is the key to it. Here’s one which failed on both my machine and a 128 GB machine. All the failures I’ve seen so far are on Windows. Some work units have failed on my machines but succeeded on Android or macOS. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 18,215 |
All the failures I’ve seen so far are on Windows. Some work units have failed on my machines but succeeded on Android or macOS.Add Linux applications to that. I've got heaps of RAM on my Windows systems but Tasks that crashed and burnt on mine completed OK on some Linux systems (one or 2 also errored out on the Linux application, but most ran to completion OK). Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 7,594 |
I've got heaps of RAM on my Windows systems but Tasks that crashed and burnt on mine completed OK on some Linux systems (one or 2 also errored out on the Linux application, but most ran to completion OK). Yeap Seems a Windows app problem, also today 1319285270 1319282688 - Unhandled Exception Record - |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 7,594 |
Now a lot of errors of "hHH000001_dummy" wus -1073741819 (0xC0000005) STATUS_ACCESS_VIOLATION |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 18,215 |
Now a lot of errors of "hHH000001_dummy" wusI've had a couple of Tasks Validate, but that's about all so far. Errors outnumber Valids by a huge margin. Grant Darwin NT |
Kissagogo27 Send message Joined: 31 Mar 20 Posts: 86 Credit: 2,919,932 RAC: 2,098 |
Hi, new sort of errors for me with " FF_gogogo_0_SAVE_ALL_OUT_IGNORE_THE_REST_6lp4ci5n_1056733_2_0" Wu Stderr.txt lot of : warning: filename too long--truncating. and [ m_m_m_JHR_b2_03207_n_full_17_000000231_0000100002_0000001_0_89_103_H_._JHR_b2_00801_n_full_17_0001_0001_0000200002_0000001_0_0001_full_85_100_H_._JHR_b2_00430_n_full_17_0000100004_0000031_0_0001_0001_0000200008_0000001_1_0001_7_14_H_._JHR_bd4_00177_nS_0022_00 ] |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 4,484 |
I’m going to sulk, why can’t I have some of these errors - I never get any errors! it’s not fair! I WANT MY SHARE OF ERRORS! |
Brian Nixon Send message Joined: 12 Apr 20 Posts: 293 Credit: 8,432,366 RAC: 0 |
Come to the light side… Use Windows… Learn to love those 25 year-old path length limitations… Enjoy the obscure bugs… ;-) |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 4,484 |
Come to the light side… Ug - I don’t want them THAT badly. I was setting up a Win10 machine for my granddaughter to use for on-line schooling and it was not a pleasant experience. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 7,594 |
I’m going to sulk, why can’t I have some of these errors - I never get any errors! it’s not fair! If i'm not wrong, the native app is for linux and, after, they compile for Windows. Maybe this is the problem... |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 4,484 |
I’m going to sulk, why can’t I have some of these errors - I never get any errors! it’s not fair! Windows is the problem??? Yes, Windows is always a problem :-) More seriously, that should not be a problem if they have a good testbed and apply it to both versions. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 7,594 |
More seriously, that should not be a problem if they have a good testbed and apply it to both versions. They said, in a recent publication, that almost half of Rosetta code is useless: By 2019, the RosettaCommons has grown to laboratories at 71 institutions worldwide, overseeing a project consisting of over 3 million lines of code with contributions from over 800 scientists.....we estimate that the codebase could be reduced by half without a significant loss of functionality. |
Phil McCrum Send message Joined: 17 Apr 10 Posts: 3 Credit: 244,691 RAC: 0 |
I am using BOINC 7.16.11 (x64). Windows 10 Home I am attempting to run four Rosetta 4.20 jobs. All four of them are counting up. I've exited and restarted BOINC. I've suspended a couple to see if the remaining two would straighten out. They are still counting up. Do I need to abort them and/or am I just not able to run Rosetta 4.20 jobs? |
Brian Nixon Send message Joined: 12 Apr 20 Posts: 293 Credit: 8,432,366 RAC: 0 |
What do you mean by “counting up”? Are you saying the time in the ‘Remaining’ column is continually increasing even though the tasks are running? Note that the displayed remaining time and percentage progress are only rough estimates; all tasks should run for 8 hours of CPU time (give or take a few minutes). |
Phil McCrum Send message Joined: 17 Apr 10 Posts: 3 Credit: 244,691 RAC: 0 |
The tasks came in with an estimate of a little over 7 hours. I ran them for well over an hour and the estimate remaining was still a little over 7 hours. Am I just being too impatient? |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 4,484 |
The tasks came in with an estimate of a little over 7 hours. I ran them for well over an hour and the estimate remaining was still a little over 7 hours. Am I just being too impatient? Yes, especially if Rosetta is a new project for this machine. It takes a few days for the system to settle down and get used to the environment before the estimated times can be relied on. |
Message boards :
Number crunching :
Rosetta 4.1+ and 4.2+
©2024 University of Washington
https://www.bakerlab.org