Message boards : Number crunching : New jobs in Rosetta Python projects
Author | Message |
---|---|
dubadmin Volunteer moderator Project developer Project scientist Send message Joined: 17 Jan 24 Posts: 1 Credit: 0 RAC: 0 |
We will be releasing new jobs using the Rosetta Python projects application with RoseTTAFold All-Atom, our state-of-the-art deep learning based application that predict the structures of proteins in complex with ligands such as DNA and other biological molecules and also models post-translational modifications. As a reminder, this application will be released in the form of virtual machine. To help us with testing and gathering performance statistics, we ask that you do not abort related work units. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1677 Credit: 17,749,522 RAC: 22,954 |
Will they make use of Multiattach mode disk images? The biggest problem (of many) with Python tasks making use of VMs is that each & every one requires massive amounts of disk space & the disk I/O needed when starting each new Task. So the disk I/O and space required for 32core/64thread and larger systems is more than just somewhat ridiculous. Grant Darwin NT |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
Will they make use of Multiattach mode disk images? In Ralph no. They are still using the old virtualbox wrapper (26203) without multiattach mode. In December the boinc team released a new version (26207)..... |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
I see there is over 500 wus of python app. With the old version (June 2021). |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
In December the boinc team released a new version (26207)..... Here you can find the latest vbox wrapper and some guides |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
In December the boinc team released a new version (26207)..... I wrote to a boinc developer and he said to me that the latest stable is 26206 (26207 is in beta). This is why is not here, in release notes |
kksplace Send message Joined: 12 May 19 Posts: 7 Credit: 5,303,601 RAC: 0 |
You asked not to abort these tasks. I received four today. They all show "Postponed: Communication with VM Hypervisor failed." after about 12 seconds. Should I just leave them or abort them? |
Ryan Munro Send message Joined: 19 Nov 15 Posts: 16 Credit: 13,977,143 RAC: 5,633 |
I am getting the error "Postponed: Communication with VM Hypervisor failed" as well, Linux Mint 21.3 running the latest version of Virtualbox, any one know of a fix? |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
No problems here, with my Windows 11 machines. A little note: in my 12-cores cpu i started, manually, 2/3 wus at the same time (and not twelve), 'cause disk and ram usage |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
A strange beaviour. Every time i finished all my wus and donwloaded other wus, i re-downloaded also the .vdi file (5 gb) |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,152,433 RAC: 4,296 |
We will be releasing new jobs using the Rosetta Python projects application with RoseTTAFold All-Atom, our state-of-the-art deep learning based application that predict the structures of proteins in complex with ligands such as DNA and other biological molecules and also models post-translational modifications. As a reminder, this application will be released in the form of virtual machine. To help us with testing and gathering performance statistics, we ask that you do not abort related work units. I know Rosetta hasn't ever done this but these new Python tasks take 7.45gb of memory for EACH task on my Windows pc's, i have pc's with anything from 6 to 32 cpu cores but not a one of them has enough memory to fill every cpu core with a Python task and not lock up the pc to the point only a reboot and reducing the physical number of tasks I can run at Rosetta. Normally it's not a problem with anything EXCEPT the Python tasks so what I'm asking is you add an optional setting to run the Python tasks, be it thru a choice box or thru a box that says ie 'allow beta tasks' that we can then select. The problem with the 2nd option alone is that people can't choose to not run the regular Rosetta tasks while they wait for any Python tasks to come their way so they could easily end up with the same problem I'm having with lack of memory in my desktops. A simple set of type of tasks choice boxes would make things a whole lot easier on the users end and not really change anything beyond the initial setup for you Admins on your end. This page shows what I am talking about but most Projects have something similar to it: https://escatter11.fullerton.edu/nfs/prefs.php?subset=project In case the link doesn't work, it's to my Preferences page at NFS, here is a copy of one section of it: Home Resource share 1 Use CPU Run only the selected applications 14e Lattice Sieve (lasieved) - uses less than 0.5 GB memory, work may be infrequently available: yes 15e Lattice Sieve for smaller numbers (lasievee_small) - small numbers, uses up to 0.8 GB memory: no 15e Lattice Sieve (lasievee) - medium numbers, uses up to 1 GB memory: no 16e Lattice Sieve for smaller numbers (lasievef_small) - large numbers, uses up to 1 GB memory: no 16e Lattice Sieve V5 (lasieve5f) - largest numbers, uses up to 1.25 GB memory: no If no work for selected applications is available, accept work from other applications? no Thank you for reading this and I hope you can discuss and implement this soon . Thank you for you time mikey |
Ryan Munro Send message Joined: 19 Nov 15 Posts: 16 Credit: 13,977,143 RAC: 5,633 |
Anyone managed to get the VM jobs working on Linux? |
Link Send message Joined: 4 May 07 Posts: 356 Credit: 382,349 RAC: 0 |
I know Rosetta hasn't ever done this but these new Python tasks take 7.45gb of memory for EACH task on my Windows pc's, i have pc's with anything from 6 to 32 cpu cores but not a one of them has enough memory to fill every cpu core with a Python task and not lock up the pc to the point only a reboot and reducing the physical number of tasks I can run at Rosetta. Normally it's not a problem with anything EXCEPT the Python tasksThan limit the amount of concurrent python tasks using an app_confil.xml. This can be even different for each of your computers, depending on the amount of memory. . |
highCon Send message Joined: 25 Aug 19 Posts: 1 Credit: 8,163,882 RAC: 12,208 |
Anyone managed to get the VM jobs working on Linux? Yes; it works. (Arch Linux with VirtualBox installed) Do you encounter any problems? I had these: 1. Due to the disk-hungry tasks, I had to move my BOINC folder to another partition. 2. After uploading some successfully crunched tasks, new tasks were suspended with the status "Communication with VM Hypervisor failed" (NS_ERROR_SOCKET_FAIL in the vbox logs). Digging deeper, it helped to clean the <HardDisk> entries in the VirtualBox.xml 3. Just today, some tasks were running fine, while others had the status "Postponed: VM environment needed to be cleaned up". Perhaps this was caused by the previous intervention (point 2); luckily, restarting boinc-client helped.
|
Ryan Munro Send message Joined: 19 Nov 15 Posts: 16 Credit: 13,977,143 RAC: 5,633 |
I get "Postponed: Communication with VM Hypervisor failed" when a unit is downloaded |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
Thank you for reading this and I hope you can discuss and implement this soon . Do you know that they do not read the forums and, if they read, they do NOTHING for a better usage of the project by users? |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,552,383 RAC: 6,167 |
But it definitely hurts me to see that the 5.5GB compressed vdi file is redownloaded everytime on starting new tasks (when no python-based tasks are currently crunching, which apparently keep the downloaded file alive). It seems that the project remove the vdi file from the project folder... And start the download again (still with old wrapper) |
hadron Send message Joined: 4 Sep 22 Posts: 68 Credit: 1,557,166 RAC: 197 |
Anyone managed to get the VM jobs working on Linux? Is user boinc a member of group vboxuser? Run "sudo groups boinc" to find out. Then, if necessary, use sudo groupmod -a -U boinc vboxusers to add boinc to that group. |
Ryan Munro Send message Joined: 19 Nov 15 Posts: 16 Credit: 13,977,143 RAC: 5,633 |
Output I get is : "boinc : boinc video render vboxusers" so I am assuming so? |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,152,433 RAC: 4,296 |
I know Rosetta hasn't ever done this but these new Python tasks take 7.45gb of memory for EACH task on my Windows pc's, i have pc's with anything from 6 to 32 cpu cores but not a one of them has enough memory to fill every cpu core with a Python task and not lock up the pc to the point only a reboot and reducing the physical number of tasks I can run at Rosetta. Normally it's not a problem with anything EXCEPT the Python tasks I DO and said that but the problem is as I said when I get the normal or beta tasks I will want to run as many at a time as I can and I can't do that with the current Prefences setup. I know I could use an app_config.xml file to limit this or that type of task but have forgotten how to do that and the settings are all project specific and I don't remember how to that here and besides newbies will have no clue and just give up and go elsewhere and that's NOT what anyone wants. PrimeGrid has an elaborate prefences setup that even includes number of cpu's per task in additionto the type of task and how many of that type of task to run so it IS possible to have more and better choices. Honestly I wish the Developers would just hard code it into the Server coding which would fix places like Gerasim who have totally borked the settings, but that's a whole other problem, as well as help out places like Rosetta. YES Projects should be able to disable the defaults as they do now but sameness can be an advantage for some things., |
Message boards :
Number crunching :
New jobs in Rosetta Python projects
©2024 University of Washington
https://www.bakerlab.org