I made a change to my deployment manifest that modified a network configuration. The director then drove the CPI with a delete_vm
request (which was successful) and followed that with a create_vm
request that failed:
D, [2016-06-22 19:30:59 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] DEBUG -- DirectorJobRunner: Networks have changed. Recreating VM
D, [2016-06-22 19:30:59 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] DEBUG -- DirectorJobRunner: Failed to update in place. Recreating VM
I, [2016-06-22 19:30:59 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] INFO -- DirectorJobRunner: Deleting VM
D, [2016-06-22 19:30:59 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] DEBUG -- DirectorJobRunner: External CPI sending request: {"method":"delete_vm","arguments":["21700701"],"context":{"director_uuid":"9489a253-16f8-4dd4-b2d0-d7ac38a646b7"}} with command: /var/vcap/jobs/softlayer_cpi/bin/cpi
...
D, [2016-06-22 19:32:38 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] DEBUG -- DirectorJobRunner: External CPI got response: {"result":null,"error":null,"log":""}, err: , exit_status: pid 29017 exit 0
...
I, [2016-06-22 19:32:38 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] INFO -- DirectorJobRunner: Creating VM
D, [2016-06-22 19:32:38 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] DEBUG -- DirectorJobRunner: External CPI sending request: {"method":"create_vm","arguments":["fa87f505-5c60-409e-bbc8-39443b178a90","1163771",{"Bosh_ip":"10.155.248.130","Datacenter":{"Name":"dal09"},"Domain":"softlayer.com","EphemeralDiskSize":20,"HourlyBillingFlag":true,"MaxMemory":4096,"NetworkComponents":[{"MaxSpeed":1000}],"PrimaryBackendNetworkComponent":{"NetworkVlan":{"Id":1113235}},"PrimaryNetworkComponent":{"NetworkVlan":{"Id":1113225}},"PrivateNetworkOnlyFlag":false,"StartCpus":4,"VmNamePrefix":"cf-furnace-pub"},{"cf1":{"ip":"10.155.198.10","netmask":"255.255.255.192","cloud_properties":{},"default":["dns"],"dns":["10.0.80.11","10.0.80.12"],"gateway":"10.155.198.1"},"cf1-dynamic":{"type":"dynamic","cloud_properties":{},"dns":["10.0.80.11","10.0.80.12"],"ip":"10.155.171.2","netmask":"255.255.255.192","gateway":"10.155.198.1"},"public":{"ip":"169.45.188.210","netmask":"255.255.255.240","cloud_properties":{},"default":["gateway"],"gateway":"169.45.188.209"}},[],{"bosh":{"password":null}}],"context":{"director_uuid":"9489a253-16f8-4dd4-b2d0-d7ac38a646b7"}} with command: /var/vcap/jobs/softlayer_cpi/bin/cpi
D, [2016-06-22 19:32:38 #28586] [canary_update(ha_proxy_z1/0 (1280b3d6-09ef-4c32-b4cc-7c7828b986be))] DEBUG -- DirectorJobRunner: External CPI got response: {"result":null,"error":{"type":"Bosh::Clouds::CloudError","message":"OS Reloading VM with agent ID 'fa87f505-5c60-409e-bbc8-39443b178a90': Could not find virtual guest by ip address: 10.155.171.2: softlayer-go: could not SoftLayer_Account#getVirtualGuests, HTTP error code: '500'","ok_to_retry":false},"log":""}, err: , exit_status: pid 29031 exit 0
The root cause appears (again) to be related to the use of the IP address to find the VM for a reload.