#649 configure iscsi/multipath device for node in IAD2
Closed: Fixed with Explanation by arrfab. Opened by arrfab.

This is the counterpart of ticket I created initially at Fedora side to have some storage for a CentOS server in IAD2 (instead of expanding local storage on the dell server) : https://pagure.io/fedora-infrastructure/issue/10486

Now that LUNs were created and internal ticket for vlan/network worked on, we just need to :

  • setup iscsi-initiator
  • configure multipath for device-mapper
  • setup lvm/vg/lv
  • migrate local storage to netapp

Metadata Update from @arrfab:
- Issue assigned to arrfab

Metadata Update from @arrfab:
- Issue tagged with: centos-common-infra, high-gain, medium-trouble

Multipath was configured successfully :

3600a098038314334463f504b5a372f4d dm-5 NETAPP  ,LUN C-Mode      
size=10T features='4 queue_if_no_path pg_init_retries 50 retain_attached_hw_handle' hwhandler='1 alua' wp=rw
|-+- policy='service-time 0' prio=50 status=active
| `- 21:0:0:1 sdf     8:80  active ready running
`-+- policy='service-time 0' prio=10 status=enabled
  `- 20:0:0:1 sdd     8:48  active ready running
3600a098038314334463f504b5a372f4c dm-4 NETAPP  ,LUN C-Mode      
size=10T features='4 queue_if_no_path pg_init_retries 50 retain_attached_hw_handle' hwhandler='1 alua' wp=rw
|-+- policy='service-time 0' prio=50 status=active
| `- 21:0:0:0 sde     8:64  active ready running
`-+- policy='service-time 0' prio=10 status=enabled
  `- 20:0:0:0 sdc     8:32  active ready running

and we have now also each mpath device a pv in a new vg

 PV                                            VG         Fmt  Attr PSize    PFree   
  /dev/mapper/3600a098038314334463f504b5a372f4c vg_netapp  lvm2 a--   <10.00t       0 
  /dev/mapper/3600a098038314334463f504b5a372f4d vg_netapp  lvm2 a--   <10.00t    2.39g

Problem is now that when trying to move data , it triggers a hardware reset of the server and from Dell iDrac logs we have this :

Correctable Machine Check Exception detected on CPU 2.
Multi-bit memory errors detected on a memory device at location(s) DIMM_B4

So we had to stop migrating data to netapp as it automatically triggers a os reset. We'll track this with Dell and see we can have that server fixed at hardware level

update/status : still waiting for Dell to fix the hardware issue on that server

Dell technician finally showed up , replaced HDD, memory module but also the network adapter (based on report he sent us) so now the machine doesn't boot correctly online, as it also had the deps on the remote iscsi block device ....

Server is now fully back online and with multipath working for the exposed iscsi luns on the netapp filer.
Closing this ticket and creating other one for the data migration

Metadata Update from @arrfab:
- Issue close_status updated to: Fixed with Explanation
- Issue status updated to: Closed (was: Open)

Log in to comment on this ticket.

Metadata