Monday, July 18, 2022

[fedora-arm] Re: Failures booting on Pi4

[root@fedora system]# [ 1907.572507] Unable to handle kernel paging request at virtual address fffffbfffe800064
[ 1907.581112] Mem abort info:
[ 1907.583955] ESR = 0x0000000096000006
[ 1907.587763] EC = 0x25: DABT (current EL), IL = 32 bits
[ 1907.593242] SET = 0, FnV = 0
[ 1907.596347] EA = 0, S1PTW = 0
[ 1907.599539] FSC = 0x06: level 2 translation fault
[ 1907.604542] Data abort info:
[ 1907.607472] ISV = 0, ISS = 0x00000006
[ 1907.611410] CM = 0, WnR = 0
[ 1907.614426] swapper pgtable: 4k pages, 48-bit VAs, pgdp=000000001d8d0000
[ 1907.621273] [fffffbfffe800064] pgd=000000001e6e1003, p4d=000000001e6e1003, pud=000000001e6e2003, pmd=0000000000000000
[ 1907.632326] Internal error: Oops: 96000006 [#1] SMP
[ 1907.637279] Modules linked in: ufs hfsplus hfs minix msdos jfs xfs tls snd_seq_dummy snd_hrtimer snd_seq rfcomm snd_seq_device nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv4 nf_reject_ipv6 nft_reject nft_ct nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nf_tables nfnetlink qrtr bnep bcm2835_v4l2(C) bcm2835_mmal_vchiq(C) videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_common videodev snd_soc_hdmi_codec snd_bcm2835(C) mc cpufreq_dt btsdio brcmfmac brcmutil vfat joydev fat hci_uart cfg80211 raspberrypi_cpufreq btqca btrtl btbcm btintel bluetooth rfkill vchiq(C) iproc_rng200 vc4 bcm2711_thermal snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine leds_gpio fuse zram uas usb_storage snd_pcm broadcom dwc2 snd_timer crct10dif_ce raspberrypi_hwmon gpio_raspberrypi_exp bcm_phy_lib clk_bcm2711_dvp snd udc_core pwm_bcm2835 soundcore genet drm_display_helper bcm2835_wdt
[ 1907.637451] i2c_bcm2835 pcie_brcmstb cec mdio_bcm_unimac bcm2835_dma sdhci_iproc sdhci_pltfm sdhci phy_generic drm_cma_helper sunrpc scsi_dh_rdac scsi_dh_emc scsi_dh_alua dm_multipath aes_neon_bs
[ 1907.743334] CPU: 2 PID: 2979 Comm: kbdrate Tainted: G C -------- --- 5.19.0-0.rc6.20220714git4a57a8400075.49.fc37.aarch64 #1
[ 1907.756124] Hardware name: Unknown Unknown Product/Unknown Product, BIOS 2022.07-rc6 07/01/2022
[ 1907.764945] pstate: 80400005 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
[ 1907.772003] pc : logic_inb+0x80/0xb4
[ 1907.775631] lr : read_port+0x124/0x190
[ 1907.779428] sp : ffff800009c9baf0
[ 1907.782782] x29: ffff800009c9baf0 x28: ffff3c6aeaca4000 x27: 0000000000000000
[ 1907.790021] x26: 0000000000000000 x25: 0000ffffcc27e05f x24: ffff3c6aeaca4000
[ 1907.797258] x23: 0000ffffcc27e060 x22: 0000ffffffffffff x21: 00ffffcc27e05f00
[ 1907.804496] x20: 0000000000000064 x19: 0000000000000064 x18: 0000000000000000
[ 1907.811733] x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000
[ 1907.818970] x14: 0000000000000000 x13: 0000000000000000 x12: 0000000000000000
[ 1907.826206] x11: 0000000000000000 x10: 0000000000000000 x9 : ffffa9eca18a7dcc
[ 1907.833443] x8 : ffff3c6b0b388c28 x7 : 00000000e37a5742 x6 : 00000000637c0136
[ 1907.840680] x5 : 00ffffffffffffff x4 : 0000000000000000 x3 : 00000000000000c0
[ 1907.847917] x2 : 0000ffffcc27e05f x1 : 0000000000ffbffe x0 : fffffbfffe800064
[ 1907.855156] Call trace:
[ 1907.857629] logic_inb+0x80/0xb4
[ 1907.860898] read_port+0x124/0x190
[ 1907.864342] vfs_read+0xbc/0x1e4
[ 1907.867612] ksys_read+0x68/0xec
[ 1907.870879] __arm64_sys_read+0x28/0x34
[ 1907.874764] invoke_syscall+0x78/0x100
[ 1907.878560] el0_svc_common.constprop.0+0x68/0x124
[ 1907.883414] do_el0_svc+0x38/0x4c
[ 1907.886769] el0_svc+0x60/0x180
[ 1907.889951] el0t_64_sync_handler+0x11c/0x150
[ 1907.894365] el0t_64_sync+0x190/0x194
[ 1907.898076] Code: d2bfd000 f2df7fe0 f2ffffe0 8b000260 (39400000)
[ 1907.904254] ---[ end trace 0000000000000000 ]---

Message from syslogd@fedora at Jul 18 15:38:38 ...
kernel:Internal error: Oops: 96000006 [#1] SMP

Message from syslogd@fedora at Jul 18 15:38:39 ...
kernel:Code: d2bfd000 f2df7fe0 f2ffffe0 8b000260 (39400000)

On 7/18/22 03:26 PM, Chris Adams wrote:
> Once upon a time, Steven A. Falco <stevenfalco@gmail.com> said:
>> So, there is something bizarre happening with uSD cards with most of the images I tested. The only image that works for me on a uSD card is the Server image. Note that I tried several different uSD cards - Samsung, SanDisk, and a "no name" card, and got consistent results, so I don't think the uSD cards are at fault.
>
> If you take off "rhgb quiet" from the kernel command line, do you see
> kernel error messages? If I try to boot Fedora on a Pi 4B with any card
> in the uSD slot, I get a loop of these from the kernel:
>
> mmc0: error -22 whilst initialising SD card
> mmc0: invalid bus width

Interesting. I get an mmc0 error, but it looks different. I get this in a loop:

[ 51.973829] mmc0: ADMA error: 0x02000000
[ 51.982126] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 51.993799] mmc0: sdhci: Sys addr: 0x00000000 | Version: 0x00001002
[ 52.005088] mmc0: sdhci: Blk size: 0x00007008 | Blk cnt: 0x00000001
[ 52.016654] mmc0: sdhci: Argument: 0x00000000 | Trn mode: 0x00000013
[ 52.028282] mmc0: sdhci: Present: 0x1fff0206 | Host ctl: 0x00000011
[ 52.039924] mmc0: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
[ 52.051544] mmc0: sdhci: Wake-up: 0x00000000 | Clock: 0x0000a707
[ 52.063216] mmc0: sdhci: Timeout: 0x00000000 | Int stat: 0x00000000
[ 52.074896] mmc0: sdhci: Int enab: 0x03ff100b | Sig enab: 0x03ff100b
[ 52.086568] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000000
[ 52.098224] mmc0: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
[ 52.109936] mmc0: sdhci: Cmd: 0x0000333a | Max curr: 0x00080008
[ 52.121629] mmc0: sdhci: Resp[0]: 0x00000920 | Resp[1]: 0x00ee7f7f
[ 52.133368] mmc0: sdhci: Resp[2]: 0x325b5900 | Resp[3]: 0x00400e00
[ 52.145037] mmc0: sdhci: Host ctl2: 0x00000008
[ 52.154607] mmc0: sdhci: ADMA Err: 0x00000001 | ADMA Ptr: 0xe7000200
[ 52.166233] mmc0: sdhci: ============================================
[ 52.177916] mmc0: sdhci: e7000200: DMA 0xf4802000, LEN 0x0008, Attr=0x21
[ 52.189979] mmc0: sdhci: e7000208: DMA 0x00000000, LEN 0x0000, Attr=0x03
[ 52.203909] mmc0: error -5 whilst initialising SD card

> I'm not sure how you'd get different behavior with different images
> though, because they're all the same kernel/initrd setup AFAIK
They are mostly the same setup, but perhaps there are some race conditions that are triggered by the different filesystems that the different image types use? I know that Minimal uses ext4 for /, Workstation and KDE use btrfs, and Server uses lvm + xfs.

Since the USB flash sticks are well-behaved I'll use them for now. But this is one quirky piece of hardware, and I'm not surprised it has taken so many people so many hours/years to get it to work at all.

I haven't posted the following as a bug yet, but when I ssh into the unit as root I get an oops as shown in the attached file. But if I ssh into the unit as a normal user, there is no oops. The oops is non-fatal, as far as I can tell, but unsettling regardless.

Steve

No comments:

Post a Comment