Open duindain opened 3 months ago
Hi @duindain Could you please try/tell:
In docker passing through any of these works fine individually when the frigate config is only using pcie:0
- /dev/apex_0:/dev/apex_1
- /dev/apex_1:/dev/apex_0
- /dev/apex_1:/dev/apex_1
- /dev/apex_0:/dev/apex_0
If i set the frigate config to use pcie:1 it fails
I don't have a heatsink atm, i can add one
@duindain please try it with heatsink, as it's needed anyways. If it won't help, we'll consider adapter replacement
I've put a passive heat sink on with some thermal joining pad, its definitely not high quality but the case is well ventilated, has a 120mm fan and its fairly cool here atm 14-20c ambient
I'm not sure if this is accurate or how you are meant to check (There didnt seem to be much info out there) but i get this values
When passing through just apex_0 from docker and when passing through both cat /sys/class/apex/apex_0/temp 48300 in a range so 46-48 degrees c cat /sys/class/apex/apex_1/temp -89700 this seems to always return this number
I assume the -89700 is because its not being used? or from just not running
I've tried a few combinations but apex_1 always seems to return that -89700 regardless
The temp drops a bit when i configure frigate to use both tpus presumably because its spending all its time rebooting and not actually sending anything to be processed
@duindain feels like something's wrong with either TPU card or adapter itself. If you can't inspect flipchips on your TPU card with microscope or try another card, we can try to replace adapter
@magic-blue-smoke unfortunately the best i have is a magnifying lens and i cant see anything looking broken or badly soldered, I don't have another card to try
@duindain we can try adapter board replacement. Could you contact me using a contact form at the bottom of the page?
ty, i"ve sent a message with order details and other info
I've received the new adapter unfortunately the coral is behaving the same as before with one temperature sensor reporting an out of bounds value -89700 If i enable both tpus frigate continually crashes as before but i can enable one fine
I've sent an RMA request for the coral Is there anything else to try at this point?
Mouser rejected the warranty request
Hi,
Hoping someone can help diagnose this
I've bought a m.2 dual edge accelerator and an adapter from makerfab
I've just got all the cameras working and running frigate and I'm getting constant reboots of frigate saying it can't find one of the tpus
I'm running frigate in a docker container
dmesg looks like its reporting an error from the adapter/accelerator possibly?
frigate docker compose file
Frigate logs
If i comment out in the frigate config
- /dev/apex_1:/dev/apex_1
and restart frigate container it runs and stops rebooting and dmesg stops reporting[ 115.671430] apex 0000:06:00.0: Error in device open cb: -110
I've removed the adapter and checked its seated well and no dust and reinserted it to the pci port
CPU: Ryzen 7 5700G Motherboard: B550M Steel Legend GPU: Onboard OS: Linux Mint 21.3 Virginia