Page MenuHomePhabricator

RobH (Rob Halsell)
Senior Data Center EngineerAdministrator

Projects (20)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Nov 24 2014, 1:43 PM (490 w, 3 d)
Roles
Administrator
Availability
Available
IRC Nick
RobH
LDAP User
RobH
MediaWiki User
RobH [ Global Accounts ]

My GPG Key fingerprint = CB1F C7E7 0FF8 5DB2 6820 9C7E 75ED 14C7 0245 D22A

I am an Senior Data Center Engineer on Wikimedia's Data Center SRE Team.

Please note that private message via phabricator is not my preferred contact means. Please feel free to contact me (robh) directly via irc/freenode, or email my @wikimedia.org email address.

Recent Activity

Wed, Apr 17

RobH moved T362824: Q#:rack/setup/install dbproxy200[5-8] from Backlog to Racking Tasks on the ops-codfw board.
Wed, Apr 17, 8:26 PM · SRE, ops-codfw, Data-Persistence, DC-Ops
RobH added a project to T362824: Q#:rack/setup/install dbproxy200[5-8]: ops-codfw.
Wed, Apr 17, 8:23 PM · SRE, ops-codfw, Data-Persistence, DC-Ops
RobH added a parent task for T362824: Q#:rack/setup/install dbproxy200[5-8]: Unknown Object (Task).
Wed, Apr 17, 8:21 PM · SRE, ops-codfw, Data-Persistence, DC-Ops
RobH created T362824: Q#:rack/setup/install dbproxy200[5-8].
Wed, Apr 17, 8:20 PM · SRE, ops-codfw, Data-Persistence, DC-Ops
RobH added a comment to T362729: Q4:rack/setup/install cp70[01-16].

Thanks for the task @RobH! As in the previous runs, please feel free to leave these for Traffic:

Update the operations/puppet repo - this should include updates to preseed.yaml, and site.pp with roles defined by service group: https://wikitech.wikimedia.org/wiki/SRE/Dc-operations
Run the sre.hosts.reimage cookbook

Since if you do insetup::traffic, we will have to reimage to cache::text or upload anyway so it doesn't make sense to repeat the reimages, unless you think is a particular reason. Thanks!

Wed, Apr 17, 5:01 PM · Traffic, ops-magru, DC-Ops

Tue, Apr 16

RobH moved T362730: Q4:rack/setup/install magru misc servers from Backlog to Racking Tasks on the ops-magru board.
Tue, Apr 16, 11:23 PM · Traffic, netops, ops-magru, DC-Ops, Infrastructure-Foundations
RobH added a project to T362730: Q4:rack/setup/install magru misc servers: Traffic.
Tue, Apr 16, 10:43 PM · Traffic, netops, ops-magru, DC-Ops, Infrastructure-Foundations
RobH added a parent task for T362730: Q4:rack/setup/install magru misc servers: Unknown Object (Task).
Tue, Apr 16, 10:42 PM · Traffic, netops, ops-magru, DC-Ops, Infrastructure-Foundations
RobH created T362730: Q4:rack/setup/install magru misc servers.
Tue, Apr 16, 10:42 PM · Traffic, netops, ops-magru, DC-Ops, Infrastructure-Foundations
RobH added a parent task for T362729: Q4:rack/setup/install cp70[01-16]: Unknown Object (Task).
Tue, Apr 16, 10:38 PM · Traffic, ops-magru, DC-Ops
RobH moved T362729: Q4:rack/setup/install cp70[01-16] from Backlog to Racking Tasks on the ops-magru board.
Tue, Apr 16, 10:38 PM · Traffic, ops-magru, DC-Ops
RobH created T362729: Q4:rack/setup/install cp70[01-16].
Tue, Apr 16, 10:38 PM · Traffic, ops-magru, DC-Ops

Thu, Apr 11

herron awarded T361251: titan100[12] ram/ssd upgrade coordination a Party Time token.
Thu, Apr 11, 4:48 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-eqiad

Thu, Apr 4

RobH updated the task description for T360430: esams text cp nvme upgrade.
Thu, Apr 4, 6:57 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops

Thu, Mar 28

RobH moved T361251: titan100[12] ram/ssd upgrade coordination from Backlog to Racking Tasks on the ops-eqiad board.
Thu, Mar 28, 4:11 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-eqiad
RobH added a parent task for T361251: titan100[12] ram/ssd upgrade coordination: Unknown Object (Task).
Thu, Mar 28, 4:09 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-eqiad
RobH triaged T361251: titan100[12] ram/ssd upgrade coordination as Medium priority.
Thu, Mar 28, 4:09 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-eqiad
RobH added a comment to T361229: titan200[12] RAM/SSD upgrade coordination.

@Jhancock.wm: I put a typo in the top, it should be (3) dimms per host not 2, not changing it but updating in this comment so you can acknowledge and update the task description to ensure everyone's on the same page.

Thu, Mar 28, 3:39 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-codfw
RobH reassigned T361229: titan200[12] RAM/SSD upgrade coordination from Jhancock.wm to fgiunchedi.

I have located and set aside the parts to be installed.

I am available every week day between 1300 UTC and 1700 UTC. Please let me know what time/day in that works best.

Thu, Mar 28, 2:48 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-codfw
RobH moved T361229: titan200[12] RAM/SSD upgrade coordination from Backlog to Racking Tasks on the ops-codfw board.
Thu, Mar 28, 1:59 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-codfw
RobH added a parent task for T361229: titan200[12] RAM/SSD upgrade coordination: Unknown Object (Task).
Thu, Mar 28, 1:58 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-codfw
RobH triaged T361229: titan200[12] RAM/SSD upgrade coordination as Medium priority.
Thu, Mar 28, 1:57 PM · SRE Observability (FY2023/2024-Q4), SRE, observability, ops-codfw

Wed, Mar 27

RobH reassigned T360430: esams text cp nvme upgrade from RobH to Fabfur.

Reassigning from myself over to @Fabfur for reimaging at Traffic's leisure.

Wed, Mar 27, 3:29 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH added a comment to T360430: esams text cp nvme upgrade.

ESAMS remote hands began hands on work at 11:10 CET and it is now ongoing.

Wed, Mar 27, 10:33 AM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops

Tue, Mar 26

RobH added a comment to T360430: esams text cp nvme upgrade.

Remote work task is via CS1553796, remote hands has confirmed receipt of the SSDs and work to take place on March 27th @ 11AM CET.

Tue, Mar 26, 1:13 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH updated the task description for T360430: esams text cp nvme upgrade.
Tue, Mar 26, 1:13 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops

Mon, Mar 25

RobH closed T84585: add contract end dates to the ops maint & contract gcal as Invalid.

I no longer track contracts, those are handled via Coupa which has end date tracking. This old request is now invalid.

Mon, Mar 25, 7:31 PM · procurement, SRE

Fri, Mar 22

RobH moved T360789: codfw row C/D upgrade racking task from Backlog to Racking Tasks on the ops-codfw board.
Fri, Mar 22, 3:30 PM · SRE, Infrastructure-Foundations, netops, ops-codfw, DC-Ops
RobH added parent tasks for T360789: codfw row C/D upgrade racking task: Unknown Object (Task), Unknown Object (Task).
Fri, Mar 22, 3:30 PM · SRE, Infrastructure-Foundations, netops, ops-codfw, DC-Ops
RobH created T360789: codfw row C/D upgrade racking task.
Fri, Mar 22, 3:29 PM · SRE, Infrastructure-Foundations, netops, ops-codfw, DC-Ops

Thu, Mar 21

RobH updated the task description for T360430: esams text cp nvme upgrade.
Thu, Mar 21, 1:40 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH added a comment to T360430: esams text cp nvme upgrade.

CS1553796 created. Will update one they confirm the window.

Thu, Mar 21, 1:40 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH added a comment to T360430: esams text cp nvme upgrade.

We would like remote hands to fetch shipmnet DEL0158639 which contains (8) 6.5TB NVMe PCIe SSDs from Dell NL to Wikimedia.

Proposted Work Window: 2023-03-27 @ 1100 CET

Once fetched, please unbox, photograph the contents and packing slip, and stage them for installation in our servers.

We would like to schedule the actual installation to take place om 2023-03-27 @ 1100 CET. The installation of these PCIe cards by Remote hands should take anywhere from 1-3 hours.

We would like remote hands to unbox and install (1) PCIe SSD NVMe card into (8) of our hosts which only contain (1) PCIe NVMe SSD and will upgrade them to (2) PCIe NVMe SSDs per host.

This will be repeated for a total of eight hosts in our racks as follows:
hostname/serial Rack:U-space

cp3066/6QGW8X3 BW27:U2
cp3067/3QGW8X3 BY27:U2
cp3068/5QGW8X3 BW27:U3
cp3069/2QGW8X3 BY27:U3
cp3070/1QGW8X3 BW27:U4
cp3071/7QGW8X3 BY27:U4
cp3072/4QGW8X3 BW27:U5
cp3073/JPGW8X3 BY27:U5

We would prefer the cadence of the work to be as follows:

  • Unplug cables from listed host that has been powered down.
  • Note serial of the PCIe NVMe SSD and install it into the host.
  • Push host back into rack rails and re-attach all cables.
  • Power on host and update us remotely so they can begin testing of the new hardware as remote hands works on the next one.

During the work window I'll be online remotely from the USA, so any updates can be sent via the ticket or via text message to +1.727.255.4597 or via email or google hangout to rhalsell@wikimedia.org.

Once the first host has the PCIe Card installed and cables re-attached, please update us so we can begin remote testing to ensure there are no issues while remote hands continues to install PCIe SSDs into the rest of the 8 total hosts.

Would remote hands please review the above directions for clarity or questions and also confirm the work window of 2023-03-27 @ 1100 CET.

Thank you in advance,

Thu, Mar 21, 1:18 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops

Mar 19 2024

RobH added a comment to T360430: esams text cp nvme upgrade.

Remote hands won't have any ability to power down a host other than by pressing the front power button. It would reduce potential complexity if we power down all the hosts for them to work on to prevent confusion. That way they know if it is powered off and matches the list, they can work on it.

Mar 19 2024, 3:33 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH updated the task description for T360430: esams text cp nvme upgrade.
Mar 19 2024, 3:32 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH added a comment to T355353: Q3:rack/setup/install dbprov100[56].

I will take care, as I discussed previously with John, but to avoid future mistakes, @RobH is there a way to transmit the desired recipe clearer? I wrote: "Partman recipe and/or desired Raid Level: db.cfg" And I can see that as confusing because of the hostname, so trying to make your team's workflow easier, in any way I can- any tip?

Mar 19 2024, 3:12 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH reassigned T355355: Q3:rack/setup/install dbprov200[56] from Jhancock.wm to jcrespo.

This installation is blocked until patchsets to allow installation are complete. I've removed the assignment from @Jhancock.wm to @jcrespo but once that is complete please assign it back to them.

Mar 19 2024, 2:55 PM · Patch-For-Review, SRE, Data-Persistence, ops-codfw, DC-Ops
RobH reassigned T355353: Q3:rack/setup/install dbprov100[56] from VRiley-WMF to jcrespo.

This installation is blocked until patchsets to allow installation are complete. I've removed the assignment from @VRiley-WMF to @jcrespo but once that is complete please assign it back to them.

Mar 19 2024, 2:55 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH added a comment to T355353: Q3:rack/setup/install dbprov100[56].

My patchset had mistakes, and @jcrespo has advised he is workong on these patchsets. As such, I've abandoned my patchset.

Mar 19 2024, 2:54 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH added a comment to T360430: esams text cp nvme upgrade.

Chatted with @ssingh as I had neglected some items we had discussed previously:

Mar 19 2024, 2:24 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH claimed T360430: esams text cp nvme upgrade.
Mar 19 2024, 2:22 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH updated the task description for T360430: esams text cp nvme upgrade.
Mar 19 2024, 1:37 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH added a parent task for T360430: esams text cp nvme upgrade: Unknown Object (Task).
Mar 19 2024, 1:36 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops
RobH created T360430: esams text cp nvme upgrade.
Mar 19 2024, 1:36 PM · Patch-For-Review, SRE, Traffic, ops-esams, DC-Ops

Mar 18 2024

RobH updated subscribers of T360285: Netbox: mismatched device models: PowerEdge R450 - ConfigE-10G (netbox) != PowerEdge R650xs (puppetdb).

It appears when these were racked they had the incorrect model selected. There was already an R650 in Netbox (that could have been used) but I've also appended in https://netbox.wikimedia.org/dcim/device-types/288/

Mar 18 2024, 1:09 PM · SRE, DC-Ops, ops-codfw
RobH assigned T360285: Netbox: mismatched device models: PowerEdge R450 - ConfigE-10G (netbox) != PowerEdge R650xs (puppetdb) to Jhancock.wm.
Mar 18 2024, 1:08 PM · SRE, DC-Ops, ops-codfw

Mar 8 2024

RobH added a comment to T359632: install (2) 1.92TB SSDs from decom into prometheus100[56].

Awesome, post-offsite please coordinate with @fgiunchedi on when to install these. As the systems are hot swap, it shouldn't cause an issue, but I'd clear a window just in case.

Mar 8 2024, 5:28 PM · ops-eqiad, SRE, procurement
RobH updated the task description for T359632: install (2) 1.92TB SSDs from decom into prometheus100[56].
Mar 8 2024, 1:25 PM · ops-eqiad, SRE, procurement
RobH triaged T359632: install (2) 1.92TB SSDs from decom into prometheus100[56] as Medium priority.
Mar 8 2024, 1:25 PM · ops-eqiad, SRE, procurement
RobH triaged T359631: install (2) 1.92TB SSDs from decom into prometheus200[56] as Medium priority.
Mar 8 2024, 1:22 PM · ops-codfw, SRE

Mar 4 2024

RobH closed Unknown Object (Task), a subtask of T353424: Decommission Arelion's eqiad-codfw 10G link, as Resolved.
Mar 4 2024, 4:12 PM · SRE, ops-codfw, ops-eqiad

Mar 1 2024

RobH closed T358809: Netbox:Report:PhysicalHosts: mistmach model issue as Resolved.

Indeed, my bad! Fixed the incorrect entires and cleared 25 entries off the report.

Mar 1 2024, 2:00 PM · Infrastructure-Foundations, DC-Ops, netbox

Feb 29 2024

RobH added a comment to T358809: Netbox:Report:PhysicalHosts: mistmach model issue.

Is there a file where this mapping is maintained and if so, I can update in the future when I add new device models to netbox?

Feb 29 2024, 8:58 PM · Infrastructure-Foundations, DC-Ops, netbox
RobH created T358809: Netbox:Report:PhysicalHosts: mistmach model issue.
Feb 29 2024, 8:58 PM · Infrastructure-Foundations, DC-Ops, netbox

Feb 27 2024

RobH closed T358594: Remove IPV6 dns records from new database hosts as Resolved.

Cleanup completed, leaving the task open for DCOps to prevent this from happening.

Feb 27 2024, 3:48 PM · Data-Persistence, DC-Ops

Feb 26 2024

RobH added a comment to T358489: mw2420-mw2451 do have unnecessary raid controllers (configured).

I'm told there is a question on 'can we pull these raid controllers to use elsewhere' and the answer is 'no, or the host you remove it from has no controller.'

Feb 26 2024, 2:11 PM · SRE, serviceops
RobH added a comment to T358489: mw2420-mw2451 do have unnecessary raid controllers (configured).

Moritz asked me about this, and I have some background. So orders placed in January 2023 via the dell portal for standard configs also included a number of hosts with raid which should not have had raid.

Feb 26 2024, 2:09 PM · SRE, serviceops

Feb 22 2024

RobH changed the status of T353424: Decommission Arelion's eqiad-codfw 10G link from Open to Stalled.

Both disconnects are currently pending with the vendors. EQ's has a ticket submitted directly where CyrusOne is via our account reps. Updates to both will take place on their individual tasks.

Feb 22 2024, 2:02 PM · SRE, ops-codfw, ops-eqiad
RobH changed the status of Unknown Object (Task), a subtask of T353424: Decommission Arelion's eqiad-codfw 10G link, from Open to Stalled.
Feb 22 2024, 2:00 PM · SRE, ops-codfw, ops-eqiad

Feb 20 2024

RobH closed T329219: Main Tracking Task for ESAMS Migration to KNAMS as Resolved.

Only two sub-tasks open, T350621 and T342239 which are both being taken care of on their own tasks. As such a master tracking task is no longer required and is being resolved.

Feb 20 2024, 3:31 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
RobH closed Unknown Object (Task), a subtask of T351304: FPC1 Failure on cr1-esams, as Resolved.
Feb 20 2024, 3:31 PM · netops, Infrastructure-Foundations, SRE
RobH closed Unknown Object (Task), a subtask of T329219: Main Tracking Task for ESAMS Migration to KNAMS, as Resolved.
Feb 20 2024, 3:29 PM · Patch-For-Review, SRE, ops-esams, DC-Ops
RobH closed Restricted Task, a subtask of T329219: Main Tracking Task for ESAMS Migration to KNAMS, as Resolved.
Feb 20 2024, 3:10 PM · Patch-For-Review, SRE, ops-esams, DC-Ops

Feb 13 2024

RobH renamed T357415: Q3:rack/setup/install ml-staging2003 from Q#:rack/setup/install ml-staging2003 to Q3:rack/setup/install ml-staging2003.
Feb 13 2024, 1:52 PM · SRE, Machine-Learning-Team, ops-codfw, DC-Ops
RobH assigned T357415: Q3:rack/setup/install ml-staging2003 to klausman.

Please review this racking task for the GPU test host we're ordering for codfw and provide the needed details for network, confirm hostname, etc...

Feb 13 2024, 1:51 PM · SRE, Machine-Learning-Team, ops-codfw, DC-Ops
RobH added a parent task for T357415: Q3:rack/setup/install ml-staging2003: Unknown Object (Task).
Feb 13 2024, 1:46 PM · SRE, Machine-Learning-Team, ops-codfw, DC-Ops
RobH created T357415: Q3:rack/setup/install ml-staging2003.
Feb 13 2024, 1:46 PM · SRE, Machine-Learning-Team, ops-codfw, DC-Ops

Jan 30 2024

RobH moved T356216: Q#:rack/setup/install (2) cloudbackup hosts from Backlog to Racking Tasks on the ops-codfw board.

@Andrew: I've assigned this task to you for you to populate the racking details, additionally please add the servers to the site.pp file with the insetup role and their partition info.

Jan 30 2024, 8:34 PM · Patch-For-Review, SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops
RobH renamed T356216: Q#:rack/setup/install (2) cloudbackup hosts from Q#:rack/setup/install X to Q#:rack/setup/install (2) cloudbackup hosts.
Jan 30 2024, 8:33 PM · Patch-For-Review, SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops
RobH created T356216: Q#:rack/setup/install (2) cloudbackup hosts.
Jan 30 2024, 8:33 PM · Patch-For-Review, SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops

Jan 23 2024

RobH moved T355700: Q#:rack/setup/install logging-hd100[123] from Backlog to Racking Tasks on the ops-eqiad board.
Jan 23 2024, 5:47 PM · SRE, SRE Observability, ops-eqiad, DC-Ops
RobH added a parent task for T355700: Q#:rack/setup/install logging-hd100[123]: Unknown Object (Task).
Jan 23 2024, 5:47 PM · SRE, SRE Observability, ops-eqiad, DC-Ops
RobH created T355700: Q#:rack/setup/install logging-hd100[123].
Jan 23 2024, 5:47 PM · SRE, SRE Observability, ops-eqiad, DC-Ops

Jan 22 2024

RobH moved T355571: Q#:rack/setup/install an-redacteddb1001 from Backlog to Racking Tasks on the ops-eqiad board.
Jan 22 2024, 5:31 PM · Data-Platform-SRE (2024.02.12 - 2024.03.03), SRE, ops-eqiad, DC-Ops
RobH added a parent task for T355571: Q#:rack/setup/install an-redacteddb1001: Unknown Object (Task).
Jan 22 2024, 5:30 PM · Data-Platform-SRE (2024.02.12 - 2024.03.03), SRE, ops-eqiad, DC-Ops
RobH created T355571: Q#:rack/setup/install an-redacteddb1001.
Jan 22 2024, 5:30 PM · Data-Platform-SRE (2024.02.12 - 2024.03.03), SRE, ops-eqiad, DC-Ops

Jan 18 2024

RobH renamed T355353: Q3:rack/setup/install dbprov100[56] from Q#:rack/setup/install dbprov100[56] to Q3:rack/setup/install dbprov100[56].
Jan 18 2024, 7:23 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH moved T355355: Q3:rack/setup/install dbprov200[56] from Backlog to Racking Tasks on the ops-codfw board.
Jan 18 2024, 7:22 PM · Patch-For-Review, SRE, Data-Persistence, ops-codfw, DC-Ops
RobH added a parent task for T355355: Q3:rack/setup/install dbprov200[56]: Unknown Object (Task).
Jan 18 2024, 7:22 PM · Patch-For-Review, SRE, Data-Persistence, ops-codfw, DC-Ops
RobH created T355355: Q3:rack/setup/install dbprov200[56].
Jan 18 2024, 7:21 PM · Patch-For-Review, SRE, Data-Persistence, ops-codfw, DC-Ops
RobH moved T355353: Q3:rack/setup/install dbprov100[56] from Backlog to Racking Tasks on the ops-eqiad board.
Jan 18 2024, 7:20 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH added a parent task for T355353: Q3:rack/setup/install dbprov100[56]: Unknown Object (Task).
Jan 18 2024, 7:20 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH created T355353: Q3:rack/setup/install dbprov100[56].
Jan 18 2024, 7:19 PM · Patch-For-Review, SRE, Data-Persistence, ops-eqiad, DC-Ops
RobH moved T355350: Q#:rack/setup/install db2196-db2220 from Backlog to Racking Tasks on the ops-codfw board.
Jan 18 2024, 5:13 PM · Data-Persistence, SRE, ops-codfw, DC-Ops
RobH updated the task description for T355350: Q#:rack/setup/install db2196-db2220.
Jan 18 2024, 5:13 PM · Data-Persistence, SRE, ops-codfw, DC-Ops
RobH added a parent task for T355350: Q#:rack/setup/install db2196-db2220: Unknown Object (Task).
Jan 18 2024, 5:11 PM · Data-Persistence, SRE, ops-codfw, DC-Ops
RobH created T355350: Q#:rack/setup/install db2196-db2220.
Jan 18 2024, 5:11 PM · Data-Persistence, SRE, ops-codfw, DC-Ops

Jan 17 2024

RobH added a comment to T354732: cr2-codfw:FPC0 failure.

When I login to cr2-codfw, I cannot see the serial of the line card in quetion:

Jan 17 2024, 10:00 PM · SRE, ops-codfw
RobH changed the edit policy for ops-magru.
Jan 17 2024, 5:25 PM
RobH created ops-magru.
Jan 17 2024, 5:24 PM
RobH edited Description on ops-esams.
Jan 17 2024, 5:24 PM

Jan 11 2024

RobH added a comment to T353424: Decommission Arelion's eqiad-codfw 10G link.

As the disconnects will reference contract IDs and disconnect fees, each site's disconnect has bene put to is own S4 space subtask.

Jan 11 2024, 7:58 PM · SRE, ops-codfw, ops-eqiad
RobH added a subtask for T353424: Decommission Arelion's eqiad-codfw 10G link: Unknown Object (Task).
Jan 11 2024, 7:56 PM · SRE, ops-codfw, ops-eqiad
RobH added a subtask for T353424: Decommission Arelion's eqiad-codfw 10G link: Unknown Object (Task).
Jan 11 2024, 7:56 PM · SRE, ops-codfw, ops-eqiad
RobH added a comment to T353424: Decommission Arelion's eqiad-codfw 10G link.

@ayounsi: I never saw a blocker come on on this, so we're good to go ahead and disconnect the cross connections at each site for this correct?

Jan 11 2024, 7:53 PM · SRE, ops-codfw, ops-eqiad
RobH moved T354896: Q3:rack/setup/install cloudcontrol2009-dev.codfw.wmnet from Backlog to Racking Tasks on the ops-codfw board.
Jan 11 2024, 6:53 PM · Patch-For-Review, SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops
RobH added a parent task for T354896: Q3:rack/setup/install cloudcontrol2009-dev.codfw.wmnet: Unknown Object (Task).
Jan 11 2024, 6:53 PM · Patch-For-Review, SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops
RobH created T354896: Q3:rack/setup/install cloudcontrol2009-dev.codfw.wmnet.
Jan 11 2024, 6:52 PM · Patch-For-Review, SRE, ops-codfw, cloud-services-team (Hardware), DC-Ops
RobH moved T354893: Q3:rack/setup/install restbase10[34-42] from Backlog to Racking Tasks on the ops-eqiad board.
Jan 11 2024, 6:26 PM · SRE, RESTBase, ops-eqiad, DC-Ops
RobH added a parent task for T354893: Q3:rack/setup/install restbase10[34-42]: Unknown Object (Task).
Jan 11 2024, 6:26 PM · SRE, RESTBase, ops-eqiad, DC-Ops