⚓ T295375 Restbase migration to Buster

Subject	Repo	Branch	Lines +/-
restbase-dev: create new codfw cluster, replace old eqiad cluster	operations/puppet	production	+25 -25
[Beta Cluster] LabsServices: Move to buster restbase host	operations/mediawiki-config	master	+1 -1
restbase: change endpoint for deployment-prep to new host	operations/puppet	production	+1 -1
restbase: disable redundant jmx config	operations/puppet	production	+0 -2
restbase: add deployment-restbase04	operations/puppet	production	+5 -1
Add deployment-restbase04	mediawiki/services/restbase/deploy	master	+1 -0
restbase: remove restbase2010	operations/puppet	production	+1 -37
partman: use reuse profiles on all restbase hosts	operations/puppet	production	+5 -6
restbase: remove restbase2009	operations/puppet	production	+0 -36
partman: don't format swap volume	operations/puppet	production	+1 -1
partman: remove reuse-test from restbase2009, use linux-swap	operations/puppet	production	+3 -3
partman: add reuse partman profile for cassandra hosts	operations/puppet	production	+25 -1

		Status	Subtype	Assigned	Task
		Resolved		hnowlan	T295375 Restbase migration to Buster
		Resolved		None	T306052 Upgrade deployment-restbase03 host to Buster

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1019.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase2026.codfw.wmnet with OS buster completed:

restbase2026 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase2026.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-backend"}
{"restbase2026.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-ssl"}
{"restbase2026.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201211435_hnowlan_25630_restbase2026.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1019.eqiad.wmnet with OS buster executed with errors:

restbase1019 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1019.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1019.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1019.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1020.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1020.eqiad.wmnet with OS buster executed with errors:

restbase1020 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1020.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1020.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1020.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1021.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1021.eqiad.wmnet with OS buster executed with errors:

restbase1021 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1021.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1021.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1021.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

The reimage failed, see the cookbook logs for the details

hnowlan updated the task description. (Show Details)Jan 21 2022, 5:03 PM

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1022.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1022.eqiad.wmnet with OS buster executed with errors:

restbase1022 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1022.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1022.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1022.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1023.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1023.eqiad.wmnet with OS buster executed with errors:

restbase1023 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1023.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1023.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1023.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1024.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1024.eqiad.wmnet with OS buster executed with errors:

restbase1024 (FAIL)
- Removed from Puppet and PuppetDB if present
- Deleted any existing Puppet certificate
- Removed from Debmonitor if present
- The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1024.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1024.eqiad.wmnet with OS buster executed with errors:

restbase1024 (FAIL)
- Removed from Puppet and PuppetDB if present
- Deleted any existing Puppet certificate
- Removed from Debmonitor if present
- Forced PXE for next reboot
- Host rebooted via IPMI
- The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1025.eqiad.wmnet with OS buster

hnowlan updated the task description. (Show Details)Jan 24 2022, 12:01 PM

hnowlan updated the task description. (Show Details)Jan 24 2022, 12:08 PM

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1025.eqiad.wmnet with OS buster executed with errors:

restbase1025 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1025.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1025.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1025.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1026.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1026.eqiad.wmnet with OS buster executed with errors:

restbase1026 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1026.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1026.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1026.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1027.eqiad.wmnet with OS buster

hnowlan updated the task description. (Show Details)Jan 24 2022, 12:25 PM

hnowlan updated the task description. (Show Details)Jan 24 2022, 12:34 PM

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1027.eqiad.wmnet with OS buster executed with errors:

restbase1027 (FAIL)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1027.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1027.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1027.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes

The reimage failed, see the cookbook logs for the details

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1028.eqiad.wmnet with OS buster

hnowlan updated the task description. (Show Details)Jan 24 2022, 12:40 PM

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1028.eqiad.wmnet with OS buster completed:

restbase1028 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1028.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1028.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1028.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201241240_hnowlan_18719_restbase1028.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1029.eqiad.wmnet with OS buster

hnowlan updated the task description. (Show Details)Jan 24 2022, 1:28 PM

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1029.eqiad.wmnet with OS buster completed:

restbase1029 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1029.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1029.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1029.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201241326_hnowlan_1536_restbase1029.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1030.eqiad.wmnet with OS buster

hnowlan updated the task description. (Show Details)Jan 24 2022, 2:42 PM

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1030.eqiad.wmnet with OS buster completed:

restbase1030 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1030.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1030.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1030.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201241401_hnowlan_1900_restbase1030.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1019.eqiad.wmnet with OS buster

hnowlan updated the task description. (Show Details)Jan 26 2022, 5:50 PM

hnowlan mentioned this in T300177: scap overrides for deploy-local using -D parameter fail.Jan 26 2022, 6:12 PM

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1019.eqiad.wmnet with OS buster completed:

restbase1019 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1019.eqiad.wmnet": {"weight": 10, "pooled": "no"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1019.eqiad.wmnet": {"weight": 10, "pooled": "no"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1019.eqiad.wmnet": {"weight": 10, "pooled": "no"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201261721_hnowlan_16490_restbase1019.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=no
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=no
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=no

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1020.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1020.eqiad.wmnet with OS buster completed:

restbase1020 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1020.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1020.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1020.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201281227_hnowlan_30102_restbase1020.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes

Updated Netbox data from PuppetDB

hnowlan updated the task description. (Show Details)Jan 28 2022, 2:48 PM

hnowlan updated the task description. (Show Details)

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1021.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1021.eqiad.wmnet with OS buster completed:

restbase1021 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1021.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1021.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1021.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201281450_hnowlan_28655_restbase1021.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1022.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1022.eqiad.wmnet with OS buster completed:

restbase1022 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1022.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1022.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1022.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201281547_hnowlan_5950_restbase1022.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1023.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1023.eqiad.wmnet with OS buster completed:

restbase1023 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1023.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1023.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1023.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201281641_hnowlan_16165_restbase1023.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1024.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1024.eqiad.wmnet with OS buster completed:

restbase1024 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1024.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1024.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1024.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201281717_hnowlan_22550_restbase1024.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

hnowlan updated the task description. (Show Details)Jan 31 2022, 11:20 AM

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1025.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1025.eqiad.wmnet with OS buster completed:

restbase1025 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1025.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1025.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}
{"restbase1025.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201311120_hnowlan_26850_restbase1025.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1026.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1026.eqiad.wmnet with OS buster completed:

restbase1026 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1026.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1026.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1026.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201311202_hnowlan_15015_restbase1026.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase1027.eqiad.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase1027.eqiad.wmnet with OS buster completed:

restbase1027 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase1027.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-ssl"}
{"restbase1027.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase"}
{"restbase1027.eqiad.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=eqiad,cluster=restbase,service=restbase-backend"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202201311246_hnowlan_26234_restbase1027.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is not optimal, downtime not removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-ssl' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=eqiad,cluster=restbase,service=restbase-backend' set/pooled=yes

Updated Netbox data from PuppetDB

hnowlan updated the task description. (Show Details)Jan 31 2022, 2:23 PM

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase2017.codfw.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase2017.codfw.wmnet with OS buster completed:

restbase2017 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase2017.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase"}
{"restbase2017.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-backend"}
{"restbase2017.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202202011717_hnowlan_25220_restbase2017.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=codfw,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

hnowlan updated the task description. (Show Details)Feb 1 2022, 6:10 PM

• Pchelolo subscribed.Feb 7 2022, 5:50 PM

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase2019.codfw.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage was started by hnowlan@cumin1001 for host restbase2020.codfw.wmnet with OS buster

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase2020.codfw.wmnet with OS buster completed:

restbase2020 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase2020.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase"}
{"restbase2020.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-backend"}
{"restbase2020.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202202081106_hnowlan_28815_restbase2020.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=codfw,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-ssl' set/pooled=yes

Updated Netbox data from PuppetDB

Cookbook cookbooks.sre.hosts.reimage started by hnowlan@cumin1001 for host restbase2019.codfw.wmnet with OS buster completed:

restbase2019 (WARN)
- Downtimed on Icinga
- Set pooled=inactive for the following services on confctl:

{"restbase2019.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase"}
{"restbase2019.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-backend"}
{"restbase2019.codfw.wmnet": {"weight": 10, "pooled": "yes"}, "tags": "dc=codfw,cluster=restbase,service=restbase-ssl"}

Disabled Puppet
Removed from Puppet and PuppetDB if present
Deleted any existing Puppet certificate
Removed from Debmonitor if present
Forced PXE for next reboot
Host rebooted via IPMI
Host up (Debian installer)
Host up (new fresh buster OS)
Generated Puppet certificate
Signed new Puppet certificate
Run Puppet in NOOP mode to populate exported resources in PuppetDB
Found Nagios_host resource for this host in PuppetDB
Downtimed the new host on Icinga
First Puppet run completed and logged in /var/log/spicerack/sre/hosts/reimage/202202081058_hnowlan_17784_restbase2019.out
Checked BIOS boot parameters are back to normal
Rebooted
Automatic Puppet run was successful
Forced a re-check of all Icinga services for the host
Icinga status is optimal
Icinga downtime removed
Services in confctl are not automatically pooled, to restore the previous state you have to run the following commands:

sudo confctl select 'dc=codfw,cluster=restbase,service=restbase' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-backend' set/pooled=yes
sudo confctl select 'dc=codfw,cluster=restbase,service=restbase-ssl' set/pooled=yes