Hi,
We are having some issues where a couple scheduled services now seem to want to do the same job across the two servers running on our small cluster.
This popped up after a cluster restart yesterday. It is as if the scheduler is tasking both servers to execute the same service at the same time, in effect doing the same job twice and causing a bit of a mess.
Here is how it used to look in an older log file:
2017-10-20 15:37:25,861 - ^[[1;37mINFO^[[0m - 10502:Dummy-6876 - dist-core-pdf-production.poll-result-folder:22 - ----------- START ----------
2017-10-20 15:37:25,861 - ^[[1;37mINFO^[[0m - 10502:Dummy-6876 - dist-core-pdf-production.poll-result-folder:22 - ----------- END ----------
And that was that until next schedule cycle.
Here is after cluster restart:
2017-11-05 00:40:00,012 - ^[[1;37mINFO^[[0m - 24086:Dummy-3168 - dist-core-pdf-production.poll-result-folder:22 - ----------- START ----------
2017-11-05 00:40:00,014 - ^[[1;37mINFO^[[0m - 24086:Dummy-3168 - dist-core-pdf-production.poll-result-folder:22 - ----------- END ----------
2017-11-05 00:40:00,014 - ^[[1;37mINFO^[[0m - 24087:Dummy-1555 - dist-core-pdf-production.poll-result-folder:22 - ----------- START ----------
2017-11-05 00:40:00,015 - ^[[1;37mINFO^[[0m - 24087:Dummy-1555 - dist-core-pdf-production.poll-result-folder:22 - ----------- END ----------
And here is after cluster restart also, earlier than the example above. Here the two instances of this service are stealing files from each other, causing panic:
2017-11-04 23:43:00,007 - ^[[1;37mINFO^[[0m - 24108:Dummy-958 - dist-core-pdf-production.poll-result-folder:22 - ----------- START ----------
2017-11-04 23:43:00,008 - ^[[1;37mINFO^[[0m - 24107:Dummy-1908 - dist-core-pdf-production.poll-result-folder:22 - 1 pdf result file(s) up for import. Sending to result->
2017-11-04 23:43:00,009 - ^[[1;37mINFO^[[0m - 24108:Dummy-958 - dist-core-pdf-production.poll-result-folder:22 - 1 pdf result file(s) up for import. Sending to result->
2017-11-04 23:43:00,561 - ^[[1;37mINFO^[[0m - 24107:Dummy-1908 - dist-core-pdf-production.poll-result-folder:22 - [Errno 2] No such file or directory: u’/fileexchange/dist/pdf/Pdf_bundle_Queue_6_2017-11-04_F061_.zip’
2017-11-04 23:43:00,563 - ^[[1;37mINFO^[[0m - 24107:Dummy-1908 - dist-core-pdf-production.poll-result-folder:22 - Error unzipping pdf file bundle
…
We are running Zato 2.0.7 at the moment, I’ve read the posts about this regarding versions < 2.0.4 and that it was fixed.
Anyone got a fix for this apart from cluster restart and prayers?
Thanks