Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 189 · 190 · 191 · 192 · 193 · 194 · 195 . . . 300 · Next

AuthorMessage
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,717,270
RAC: 11,974
Message 105493 - Posted: 17 Mar 2022, 2:57:42 UTC - in response to Message 105490.  

I've seen that somewhere before (perhaps not this project), but they don't seem to be punishing me yet, I've gone through several thousand bad Preetham 4.2 tasks.
Spoke too soon. 2 of 7 computers are now in the stocks. Oh well, I have some RB and Python to do.
ID: 105493 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,269,631
RAC: 2,588
Message 105494 - Posted: 17 Mar 2022, 4:03:20 UTC

Sine BOINC projects reverse any penalty for failed tasks if all of the other computers given the same task fail it also.
ID: 105494 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tgbauer

Send message
Joined: 5 Jan 06
Posts: 10
Credit: 100,814,509
RAC: 74,308
Message 105495 - Posted: 17 Mar 2022, 4:17:42 UTC

Getting "Computation error" on all the most recent 1KB size tasks on MacOS and Ubuntu LTS
ID: 105495 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tgbauer

Send message
Joined: 5 Jan 06
Posts: 10
Credit: 100,814,509
RAC: 74,308
Message 105496 - Posted: 17 Mar 2022, 4:31:55 UTC - in response to Message 105495.  

For example:
Thu 17 Mar 2022 12:30:34 AM EDT | Rosetta@home | Computation for task preetham_gen_13074_0001_0001_0_SAVE_ALL_OUT_2912781_849_1 finished
Thu 17 Mar 2022 12:30:34 AM EDT | Rosetta@home | Output file preetham_gen_13074_0001_0001_0_SAVE_ALL_OUT_2912781_849_1_r439706529_0 for task preetham_gen_13074_0001_0001_0_SAVE_ALL_OUT_2912781_849_1 absent

ID: 105496 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 187
Credit: 6,370,872
RAC: 5,700
Message 105501 - Posted: 17 Mar 2022, 11:28:12 UTC - in response to Message 105494.  

BOINC projects reverse any penalty for failed tasks if all of the other computers given the same task fail it also.

It seems I am still in the penalty box.


Application details for host 5910575
Rosetta 4.20 x86_64-pc-linux-gnu
Number of tasks completed 	2591
Max tasks per day 	1      <---<<<
Number of tasks today 	0
Consecutive valid tasks 	0
Average processing rate 	2.78 GFLOPS
Average turnaround time 	0.97 days

ID: 105501 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile jay

Send message
Joined: 12 Jan 08
Posts: 20
Credit: 195,801
RAC: 0
Message 105504 - Posted: 17 Mar 2022, 17:07:10 UTC

About the 'Residue' error...

I think its a good idea to re-post the complete error from a task...

<stderr_txt>
command: ../../projects/boinc.bakerlab.org_rosetta/rosetta_4.20_x86_64-pc-linux-gnu @preetham_gen_59129_0001_0001_0.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 3988120
Using database: database_357d5d93529_n_methyl/minirosetta_database

ERROR: Error in simple_cycpcp_predict app read_sequence() function!  The minimum number of residues for a cyclic peptide is 4.  (GenKIC requires three residues, plus a fourth to serve as an anchor).
ERROR:: Exit from: src/protocols/cyclic_peptide_predict/SimpleCycpepPredictApplication.cc line: 2264
BOINC:: Error reading and gzipping output datafile: default.out
20:44:11 (20850): called boinc_finish(1)

</stderr_txt>


I run Linux and my wing-man (on this and other WU) runs windows. Both get the same error.

I **assume** this is the same error that is on this forum for the last few days.

I have set ' no new work' for Rosetta and will check back on this forum to see if the code has been fixed...


reference: https://www.rosettacommons.org/docs/latest/scripting_documentation/RosettaScripts/composite_protocols/generalized_kic/GeneralizedKIC


Happy St. Patrick's Day!
Jay
ID: 105504 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kyle Craig

Send message
Joined: 18 Dec 17
Posts: 2
Credit: 126,343
RAC: 0
Message 105505 - Posted: 17 Mar 2022, 23:13:40 UTC - in response to Message 105418.  

Virtualization is enabled in the bios. HyperV doesn't show up in windows features. There is Virtual Machine Platform and Windows Hypervisor Platform, but they are both disabled in windows. So I'm still not sure why I'm getting the hardware acceleration error.
ID: 105505 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 187
Credit: 6,370,872
RAC: 5,700
Message 105506 - Posted: 18 Mar 2022, 2:13:59 UTC

Hurray! I got a bunch of new tasks a few hours ago and six are currently running. and all have more than three hours on them instead of 40 seconds or less I have gotten lately.
ID: 105506 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 353
Credit: 1,227,479
RAC: 3,710
Message 105509 - Posted: 18 Mar 2022, 8:59:50 UTC - in response to Message 105506.  

Hurray! I got a bunch of new tasks a few hours ago and six are currently running. and all have more than three hours on them instead of 40 seconds or less I have gotten lately.


That's because you received tasks from the Robetta server. Which means those are not from the Rosetta@home/Baker lab team but from someone else.
(Robetta is a public server which allows researchers from around the world to submit jobs that will either run on Rosetta@home [if they run on Rosetta 4.20] or on the Baker Lab computational resources [if they run RoseTTAFold]).
ID: 105509 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 105510 - Posted: 18 Mar 2022, 9:34:49 UTC - in response to Message 105509.  

I've never really looked at hoe to tell the difference.

What's in the task title that shows it's robetta and not something from the lab?

And then the CASP stuff, or what's that institute that Dr. B. has founded? Where is their stuff? Or does that not come here?
ID: 105510 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mr P Hucker
Avatar

Send message
Joined: 12 Aug 06
Posts: 1600
Credit: 11,717,270
RAC: 11,974
Message 105511 - Posted: 18 Mar 2022, 9:37:46 UTC - in response to Message 105510.  

I've never really looked at hoe to tell the difference.

What's in the task title that shows it's robetta and not something from the lab?

And then the CASP stuff, or what's that institute that Dr. B. has founded? Where is their stuff? Or does that not come here?
For Robetta, the task name begins RB.
ID: 105511 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 353
Credit: 1,227,479
RAC: 3,710
Message 105512 - Posted: 18 Mar 2022, 10:12:56 UTC - in response to Message 105511.  

I've never really looked at hoe to tell the difference.

What's in the task title that shows it's robetta and not something from the lab?

And then the CASP stuff, or what's that institute that Dr. B. has founded? Where is their stuff? Or does that not come here?
For Robetta, the task name begins RB.



As for CASP, I believe those are submitted via Robettaunder the username "casp".
As for the Institute for Protein Design, I believe there was a post from an admin/scientist back in the early days of the pandemic in which it was said that Rosetta@home was being given more attention because of the massive increase in resources - in any case, if anyone from one of the other labs at the IPD needs to run something on Rosetta@home, it's probably fairly straightforward to do so.
ID: 105512 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1994
Credit: 9,524,535
RAC: 7,506
Message 105513 - Posted: 18 Mar 2022, 10:40:42 UTC - in response to Message 105512.  

As for CASP, I believe those are submitted via Robetta under the username "casp".


Here we can see the queue
ID: 105513 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 187
Credit: 6,370,872
RAC: 5,700
Message 105514 - Posted: 18 Mar 2022, 12:46:17 UTC - in response to Message 105509.  

Hurray! I got a bunch of new tasks a few hours ago and six are currently running. and all have more than three hours on them instead of 40 seconds or less I have gotten lately.

That's because you received tasks from the Robetta server. Which means those are not from the Rosetta@home/Baker lab team but from someone else.
(Robetta is a public server which allows researchers from around the world to submit jobs that will either run on Rosetta@home [if they run on Rosetta 4.20] or on the Baker Lab computational resources [if they run RoseTTAFold]).


That seems to be true. Mine are all rb_03_17_...

Does that mean that those using the Robetta server are more careful about what they offer we clients than those from the Rosetta@home/Baker lab team?
ID: 105514 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 18
Credit: 25,821,080
RAC: 16,471
Message 105525 - Posted: 18 Mar 2022, 21:10:52 UTC - in response to Message 105514.  

rb jobs seems to be made from a wide variety of people. The memory required can reach 1.6 GB or the more normal 300-500k MB. The current rb_03_18 is easy on the memory and on the CPU and the result is that the points is less then half of rb_03_17. My normal wall 230W is 155W and CPU is max 55, not the normal 63C. 3900X, 480 mm good cooling. I’m tempted to run 1 hr test run for the future and then pass the easy jobs to others. Due the the lack of job I will let the remaining jobs run to completion but Rosetta is mix bag of fun at best recently. WCG will be back soon….
ID: 105525 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 105527 - Posted: 18 Mar 2022, 23:21:34 UTC - in response to Message 105525.  

If you have virtualization and want to test your system, enable Python tasks, combine that with LHC ATLAS and your system will get a good work out.
ID: 105527 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 18
Credit: 25,821,080
RAC: 16,471
Message 105530 - Posted: 19 Mar 2022, 2:25:42 UTC - in response to Message 105527.  

If you have virtualization and want to test your system, enable Python tasks, combine that with LHC ATLAS and your system will get a good work out.


After reading yours and others problems with Python my 16 GB machine and my problem with programmers that give a f..k about our equipment that would never happen. I like that I can run decent timings on my memory and there is no way that I will invest $500 in RAM for the sole purpose to bring my SSD to a premature death.
ID: 105530 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
MStenholm

Send message
Joined: 18 Apr 20
Posts: 18
Credit: 25,821,080
RAC: 16,471
Message 105531 - Posted: 19 Mar 2022, 2:25:45 UTC - in response to Message 105527.  
Last modified: 19 Mar 2022, 2:26:52 UTC

Empty
ID: 105531 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 105533 - Posted: 19 Mar 2022, 9:15:07 UTC - in response to Message 105530.  

If you have virtualization and want to test your system, enable Python tasks, combine that with LHC ATLAS and your system will get a good work out.


After reading yours and others problems with Python my 16 GB machine and my problem with programmers that give a f..k about our equipment that would never happen. I like that I can run decent timings on my memory and there is no way that I will invest $500 in RAM for the sole purpose to bring my SSD to a premature death.



There were teething problems with Python. We got those sorted out.
The SSD premature death is hype in my opinion.
I've been running BOINC with various projects that write all the time to one of my SSD's for the last 5 years and it is still in good health. Samsung SSD's are very rugged. I am just now at 65.1 TB of writes. According to web information, I still have at least another 60TB to go before I would have to replace this drive. So that is about 10 years of writes. But it could go longer since it is over provisioned and has some unused cells.
as for 500 in RAM, I spent about 150 to upgrade my ram to handle 15 Pythons at a one time if it came to that.

I have put a bunch of money into this machine over the years. It's not the fastest out there, but it is a good build for what I want to participate in. It's cost a fair bit of money over the years, but this is the final build.
48GB RAM, 16 core, 2 SSD (1 dedicated BOINC data, 1 Windows and was formerly BOINC as well). 1HDD for storage. One of the top liquid cooling systems on the market. 550 Watt digital power supply. I run 2 GPU (until WCG comes back) and 4 CPU projects, plus FAH. That's a fair amount of data being transferred, but yet my drive hasn't given up.

LHC ATLAS is a bit of a memory hog, but it's much more stable than Python and much more mature.
The group has a lot of Gurus in it, including one from here that also is on other similar projects to my selection.

I can't tell you why RAH group all of sudden disappeared from here. I guess their funding does not allow for a person to come monitor here. That or they say they will just see it in the results. But the lack of response from their office or from a member of staff that is the tech person is not right. They seem to be isolated.

BTW, have you looked at SIdock or TN-Grid or Quchempedia?
Quchem is a Vbox project. But I had troubles keeping their work stable with all my other projects.
I run SIdock and TN-Grid as well and they are both standard CPU projects and very rock solid and don't require that much memory.
ID: 105533 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 0
Message 105534 - Posted: 19 Mar 2022, 9:16:28 UTC - in response to Message 105531.  

Empty



Next time, when you edit, just erase everything, type in two space bars of blank text and then save your post. The server will automatically delete it.
ID: 105534 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 189 · 190 · 191 · 192 · 193 · 194 · 195 . . . 300 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org