I have two boincs clients on the same computer
log in |
Message boards : Number crunching : I have two boincs clients on the same computer
Author | Message |
---|---|
"C:\Program Files\BOINC\boinc.exe" --allow_multiple_clients -dir c:\programdata\boinc_2 -gui_rpc_port 9999" | |
ID: 4954 · Rating: 0 · rate: / Reply Quote | |
See this thread Multi Clients, it may be of some help. | |
ID: 4955 · Rating: 0 · rate: / Reply Quote | |
"C:\Program Files\BOINC\boinc.exe" --allow_multiple_clients -dir c:\programdata\boinc_2 -gui_rpc_port 9999" c:\programdata\boinc_2\projects\wuprop.boinc-af.org\app_config.xml: <app_config> <app_version> <app_name>data_collect_v4</app_name> <plan_class>nci</plan_class> <avg_ncpus>0.01</avg_ncpus> <cmdline>-p 9999</cmdline> </app_version> </app_config> | |
ID: 4956 · Rating: 0 · rate: / Reply Quote | |
Yes ! I added app_config.xml in | |
ID: 4957 · Rating: 0 · rate: / Reply Quote | |
I'm trying this again but it's not working this time. | |
ID: 5860 · Rating: 0 · rate: / Reply Quote | |
And: The Project Administrator is Blocking the Second BOINC Client so you don't get to many Hr's ... | |
ID: 5861 · Rating: 0 · rate: / Reply Quote | |
The client is split because that particular project doesn't make save points for 12 to 18 hours and I lose 1/2 to 3/4 a day of work shutting down BOINC to deal with the other projects' technical difficulties or to change my app_config files. | |
ID: 5862 · Rating: 0 · rate: / Reply Quote | |
(accidental double post) | |
ID: 5863 · Rating: 0 · rate: / Reply Quote | |
Which project is this? | |
ID: 5864 · Rating: 0 · rate: / Reply Quote | |
Which project is this? A save point is where the WU saves it's progress while performing the calculations. The CPU's are used 99.99% of the time. If you interrupt the WU before it makes a save point then 4, 8 or up to 18 hours of 16 or 32 cores of CPU cycles are just thrown into the void. The split into two clients is very necessary to avoid lost work. | |
ID: 5865 · Rating: 0 · rate: / Reply Quote | |
Which project is this? Oh checkpoints, I read it wrong. Not sure a 2nd client is needed for that. | |
ID: 5867 · Rating: 0 · rate: / Reply Quote | |
Which project is this? I'm sure. Unless you know of a way to shutdown boinc.exe that forces all WU to create checkpoints even though the project doesn't support them. I tried running the project in a VM and it's RAC is 1/4th that of raw so separate client is the best solution. The project remains in RAM and calculating while maintenance is performed on the other projects and 96 to 192 CPU hours are not wasted. | |
ID: 5868 · Rating: 0 · rate: / Reply Quote | |
The client is split because that particular project doesn't make save points for 12 to 18 hours and I lose 1/2 to 3/4 a day of work shutting down BOINC to deal with the other projects' technical difficulties or to change my app_config files. Instead of restarting BOINC you can have it reread the config files to pickup any changes. Do you intend to put the project with long checkpoints in its own instance of BOINC and then have your other projects on the host in another instance? That is easily doable. You just have to make sure you have properly limited the number of processors on each instance so the total number of running tasks does not exceed the number of actual processors. A host with 16 processors can not run more than 16 CPU tasks at once across any number of instances. Otherwise a WUprop server rule will ignore one, or all, of the clients running on that host. | |
ID: 5869 · Rating: 0 · rate: / Reply Quote | |
the number of processors on each instance so the total number of running tasks does not exceed the number of actual processors. Is WUProps sophisticated enough to count the number of cores used by the multicore Virtual Boxes? Instead of restarting BOINC you can have it reread the config files to pickup any changes. That's nice to know (I'm forgetting things. I used to know that.) but it doesn't clear up the "Environment needs to be cleaned up" and 'Postponed" error messages of the Virtual Box WU's where I have to suspend all work and kill the vboxservice to clean up the errors. Known defect in LHC WU's that they won't address. Keeping the client separate is still the best solution for this reason. | |
ID: 5870 · Rating: 0 · rate: / Reply Quote | |
I still don't get why you're shutting down a client. | |
ID: 5871 · Rating: 0 · rate: / Reply Quote | |
I still don't get why you're shutting down a client. but it doesn't clear up the "Environment needs to be cleaned up" and 'Postponed" error messages of the Virtual Box WU's where I have to suspend all work and kill the vboxservice to clean up the errors. I have to shut down each client and restart Virtual Box minimum every 3 days. Message discussion here: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4526 | |
ID: 5872 · Rating: 0 · rate: / Reply Quote | |
Encountered another situation today. | |
ID: 5873 · Rating: 0 · rate: / Reply Quote | |
Encountered another situation today. There is a known issue with BOINC not updating the display of resources used by tasks when values are changed in app_config until after it restarts. Even though it is using the new settings. For example changing the settings from <gpu_versions> <gpu_usage>1.0</gpu_usage> <cpu_usage>1.0</cpu_usage> </gpu_versions> to <gpu_versions> <gpu_usage>0.5</gpu_usage> <cpu_usage>0.5</cpu_usage> </gpu_versions> in order to run 2 tasks per GPU will continue to display Running (1 CPU + 1 GPU) with 2 tasks running instead of displaying Running (0.5 CPU + 0.5 GPU) That could be the same issue you were seeing. Unless there is a separate issue where app_config settings are not correctly applied to VMs after being read. | |
ID: 5874 · Rating: 0 · rate: / Reply Quote | |
A couple of the new WU that showed up in Virtual Box manager were actually 1 core instead of 2, so it was a reporting error in BoincMgr as you proffer. Although I can't be certain it's the same reporting bug, it appears to be. | |
ID: 5875 · Rating: 0 · rate: / Reply Quote | |
Message boards :
Number crunching :
I have two boincs clients on the same computer