Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vadimp merge v.7.0040.3000 br #1494

Open
wants to merge 97 commits into
base: master
Choose a base branch
from

Conversation

vadimp-nvidia
Copy link
Collaborator

No description provided.

sholeksandr and others added 30 commits October 10, 2024 10:19
…rm driver

1. Add N5112 (HI169) support to mlx-platform driver
2. Change LED name led_fault -> led_status (Bug: 3896626)
3. Fix platform power_down for n5xxxn systems
4. Align FAN divider/pulses for n5xxxn to new CPLD version
5. Fix LED colors N51XX platform. Remove: "power:green" (not supported by CPLD), add: "status:green"
6. Change graceful reset flow for N51XX
  6.1. Change CPLD reg power_button -> graseful_pwr_off
  6.2. hw-mgmgt sysnc service: change poll attribute power_button -> graseful_pwr_off
       disable "GracefulShutdown" redfish request to BMC

Signed-off-by: Oleksandr Shamray <[email protected]>
Add n5112_LD TTM support HW_ID:169, SKU:HI169, VMOD0021

Signed-off-by: Oleksandr Shamray <[email protected]>
Duet to CPLD change - align FAN drwr count count to 4 for 5112_LD PO
systeem

Signed-off-by: Oleksandr Shamray <[email protected]>
Fix n51xxd_sensors.conf name:

n51xxd_sensors.conf -> n51xxld_sensors.conf

Bug: 4051715

Signed-off-by: Oleksandr Shamray <[email protected]>
Add drivetemp thermal sensor upport to n51xx ui interface

Signed-off-by: Oleksandr Shamray <[email protected]>
Change N5110 thermal sensor name: fan_amb -> mng_amb

Bug: 4051715

Signed-off-by: Oleksandr Shamray <[email protected]>
Fix sync attributes fot "module" sensor. If module exists but temperature attr
is not readable - will be created empty themperature attribute in
{hw-mgmt}/temperature folder.

Bug: 3965817

Signed-off-by: Oleksandr Shamray <[email protected]>
Fix hw-management-sync crash on BMC redfish fail

Bug: 4075088

Signed-off-by: Oleksandr Shamray <[email protected]>
Add n5101_LD MSFT support HW_ID:170, SKU:HI170, VMOD0021

Signed-off-by: Oleksandr Shamray <[email protected]>
Starting from kernel 5.12.rc-1, pmbus reads are always
forced down to chip and get the values directly. Driver
doesn't cache the values anymore.

This behviour change has caused problems in Delta-1.1K
PSUs, on which pmbus registers were not written.

This patch introduces the driver caching.

Bugs# 3874682

Signed-off-by: Ciju Rajan K <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
For systems which are running without minimal driver (with SDK only) - added
new flag "minimal_usupported". This flag can be add to system definition
function in hw-management.sh

minimal_usupported=1

Signed-off-by: Oleksandr Shamray <[email protected]>
Add sync for asics_init_done attribute

Signed-off-by: Oleksandr Shamray <[email protected]>
1. Add platform support for sn5640
2. Add "shutdown_unlock" attribute
3. Reabse downstream patches

Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
Add sync for asics_init_done attribute

Signed-off-by: Oleksandr Shamray <[email protected]>
Fixed voltmon sensor type for sn5640 system

Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
Add ALT "xdpe1a2g7" for voltmon components in SN5640

Signed-off-by: Oleksandr Shamray <[email protected]>
2. Remove UI suport since it supported only for IB

Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
Fix ASIC bus id address for SN5640 switch

Signed-off-by: Oleksandr Shamray <[email protected]>
…l driver

Fix initialisation for systems with disabled minimal driver

Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
If asic/module sensor files already created and it's a soft link - skip
updating it

Signed-off-by: Oleksandr Shamray <[email protected]>
sholeksandr and others added 25 commits November 25, 2024 08:52
Fix missing endline character in ./thermal/asic1.
Added '\n' at end of file

Bug SW #4172888

Signed-off-by: Oleksandr Shamray <[email protected]>
Add to TC component monitoring based on devtree.
Supported components: voltmonX, power_convX

Signed-off-by: Oleksandr Shamray <[email protected]>
Fix log message "Write PSU1 PWM 80". Add log only in case PSU is
present. Prevously it was printed even if PSU is out.
This issue related only to TC logging system.

Signed-off-by: Oleksandr Shamray <[email protected]>
Update redfish client get_pass_login function : sync with NVOS code

Bug: 4176442

Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
Add alternative UI labels for PDB BOM for Juiet NSO

Signed-off-by: Oleksandr Shamray <[email protected]>
Change moduleX temperature input attribute name:

moduleXX -> module XX_temp_input

Bug: 4198371

Signed-off-by: Oleksandr Shamray <[email protected]>
Add support for temp reading for module[64..65]

Bug: 4198382

Signed-off-by: Oleksandr Shamray <[email protected]>
Signed-off-by: Oleksandr Shamray <[email protected]>
Fix hw-management sync crash in case ASIC value is not exist

Bug: 4184342

Signed-off-by: Oleksandr Shamray <[email protected]>
Add Thermal config for SN5610

Bug: 4200580

Signed-off-by: Oleksandr Shamray <[email protected]>
On some PSUs volt out value exposed in out2 pmbus attribute (instead of
vout) With this fix - added correct vout attribute linkage on PSUs which
have both vout/vout2 attributes

Bug: 4143199

Signed-off-by: Oleksandr Shamray <[email protected]>
Shift I2C busess leaft by two positions.
Add two additional mux.
Add static topology for cartridges per system hardware Id.

Parsed outupt example:
cat /var/run/hw-management/eeprom/cable_cartridge4_eeprom_data
FRU Inventory From File:
/var/run/hw-management/eeprom/cable_cartridge4_eeprom

  FRU Chassis Type: Blade Enclosure
  FRU Chassis Part Number: 01-QM3-LBC-NO-NVPN
  FRU Chassis Serial Number: 999WWYY123456
  FRU Chassis Custom Info: 02h 00h 01h 00h 00h 00h 00h 00h

  FRU Board Manufacturing Date/Time: 06/07/24 - 03:14:00
  FRU Board Manufacturer: Nvidia
  FRU Board Product Name: 1-Slot GB LBC
  FRU Board Serial Number: 1782224000045
  FRU Board Part Number: HS-32836-001
  FRU FRU File ID: HS32836-001.1
  FRU Board Custom Info: APH HS-32836-001

  FRU Product Name: 1-Slot GB LBC
  FRU Product Part/Model Number: HS-32836-001
  FRU Product Version: TS2
  FRU Product Serial Number: 1782224000045
  FRU Product Asset Tag: 0000000000000000
  FRU FRU File ID: HS32836-001.1

Signed-off-by: Vadim Pasternak <[email protected]>
Update the hw_management sync script to fix module temperature reading
when in SW mode. Key changes:

1. Add a check for control mode (SW or FW) before reading. Skip reading
in SW mode.
2. Add validation for module presence.
3. Run scan and update for all modules in a single loop instead of
updating each module separately.

Bug: 4201043

Signed-off-by: Oleksandr Shamray <[email protected]>
…al driver

Fix asic_chipup_completed for systems without minimal driver. Added
feature to sync script which will handle asic state and update
attributes/counters:

- asics_init_done
- asic_chipup_completed

Bug: 34304249

Signed-off-by: Oleksandr Shamray <[email protected]>
Add rules for plugin/plugout USB net interface connecting
CPU host and BMC.

Bug: 4146744

Signed-off-by: Vadim Pasternak <[email protected]>
Signed-off-by: Felix Radensky <[email protected]>
Disable the below COME reset cuases, since they are coming as an
additional 2-nd cause: "reset_sw_reset", "reset_from_carrier",
"reset_aux_pwr_or_reload", "reset_platform", "reset_pwr".

Thus, two causes will be reported, while expectation is to provide only
one.

Disable reset COME causes to avoid duplication.

Signed-off-by: Vadim Pasternak <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants