Open mt-mrx opened 4 years ago
FYI i had a similar issue but it was due to a corrupted database (after the system spent a few hours with 100% disk usage):
influx_inspect buildtsi
gives the precise database which is corrupt:
2020-09-15T09:10:55.745251Z info Rebuilding database {"log_id": "0PGGLcyW000", "name": "gitlab"}
unexpected fault address 0x7f635f6a9071
fatal error: fault
[signal SIGBUS: bus error code=0x2 addr=0x7f635f6a9071 pc=0x90ac55]
goroutine 1 [running]:
runtime.throw(0xbc40cc, 0x5)
/usr/local/go/src/runtime/panic.go:774 +0x72 fp=0xc000113468 sp=0xc000113438 pc=0x42f8a2
runtime.sigpanic()
/usr/local/go/src/runtime/signal_unix.go:391 +0x455 fp=0xc000113498 sp=0xc000113468 pc=0x443e85
github.com/influxdata/influxdb/tsdb.ReadSeriesEntry(0x7f635f6a9071, 0x155f8f, 0x155f8f, 0x7f635f6a8fdf, 0x92, 0x156021, 0x0, 0x0, 0x9b)
/go/src/github.com/influxdata/influxdb/tsdb/series_segment.go:417 +0x35 fp=0xc0001134f8 sp=0xc000113498 pc=0x90ac55
github.com/influxdata/influxdb/tsdb.(*SeriesSegment).ForEachEntry(0xc0002c00a0, 0xc000113588, 0x1, 0x0)
/go/src/github.com/influxdata/influxdb/tsdb/series_segment.go:244 +0x8b fp=0xc000113560 sp=0xc0001134f8 pc=0x90a11b
github.com/influxdata/influxdb/tsdb.(*SeriesSegment).MaxSeriesID(0xc0002c00a0, 0x5)
/go/src/github.com/influxdata/influxdb/tsdb/series_segment.go:232 +0x5c fp=0xc0001135a8 sp=0xc000113560 pc=0x90a06c
github.com/influxdata/influxdb/tsdb.(*SeriesPartition).openSegments(0xc0003b2210, 0x0, 0xc0001136f0)
/go/src/github.com/influxdata/influxdb/tsdb/series_partition.go:124 +0x150 fp=0xc000113690 sp=0xc0001135a8 pc=0x905b90
github.com/influxdata/influxdb/tsdb.(*SeriesPartition).Open.func1(0xc0003b2210, 0x26, 0x1ff)
/go/src/github.com/influxdata/influxdb/tsdb/series_partition.go:78 +0x2f fp=0xc000113700 sp=0xc000113690 pc=0x9101af
github.com/influxdata/influxdb/tsdb.(*SeriesPartition).Open(0xc0003b2210, 0xc000316380, 0x1)
/go/src/github.com/influxdata/influxdb/tsdb/series_partition.go:95 +0xa7 fp=0xc000113758 sp=0xc000113700 pc=0x905987
github.com/influxdata/influxdb/tsdb.(*SeriesFile).Open(0xc0002c0000, 0x0, 0x0)
/go/src/github.com/influxdata/influxdb/tsdb/series_file.go:91 +0x367 fp=0xc000113a78 sp=0xc000113758 pc=0x9006b7
github.com/influxdata/influxdb/cmd/influx_inspect/buildtsi.(*Command).processDatabase(0xc000116d80, 0xc00003a8b5, 0x6, 0xc0003101a0, 0x1b, 0xc0003101c0, 0x1a, 0x0, 0x0)
/go/src/github.com/influxdata/influxdb/cmd/influx_inspect/buildtsi/buildtsi.go:257 +0x250 fp=0xc000113c68 sp=0xc000113a78 pc=0xa29a50
github.com/influxdata/influxdb/cmd/influx_inspect/buildtsi.(*Command).run(0xc000116d80, 0x7ffd3bb9a83b, 0x14, 0x7ffd3bb9a858, 0x13, 0x0, 0x7)
/go/src/github.com/influxdata/influxdb/cmd/influx_inspect/buildtsi/buildtsi.go:126 +0x3bb fp=0xc000113d90 sp=0xc000113c68 pc=0xa280db
github.com/influxdata/influxdb/cmd/influx_inspect/buildtsi.(*Command).Run(0xc000116d80, 0xc000032200, 0x4, 0x4, 0x8, 0xc000032200)
/go/src/github.com/influxdata/influxdb/cmd/influx_inspect/buildtsi/buildtsi.go:81 +0x55c fp=0xc000113df8 sp=0xc000113d90 pc=0xa27c8c
main.(*Main).Run(0xc000113f18, 0xc000032200, 0x4, 0x4, 0x0, 0x4)
/go/src/github.com/influxdata/influxdb/cmd/influx_inspect/main.go:93 +0xdc5 fp=0xc000113ec0 sp=0xc000113df8 pc=0xa56385
main.main()
/go/src/github.com/influxdata/influxdb/cmd/influx_inspect/main.go:28 +0x151 fp=0xc000113f60 sp=0xc000113ec0 pc=0xa554f1
runtime.main()
/usr/local/go/src/runtime/proc.go:203 +0x21e fp=0xc000113fe0 sp=0xc000113f60 pc=0x43123e
runtime.goexit()
/usr/local/go/src/runtime/asm_amd64.s:1357 +0x1 fp=0xc000113fe8 sp=0xc000113fe0 pc=0x45bd51
I had similar problem with a huge database taking all free disk memory. After freeing up space, influxd
and influx_inspect
both had the unexpected fault address
panic.
I deleted _series
folders of the corrupted databases and then run influx_inspect buildtsi
to rebuild the index and it works again. 🎉
Hi,
I upgraded my influxdb:1.7.10 docker container to 1.8.0 and migrated to INFLUXDB_DATA_INDEX_VERSION=tsi1, which was successful. I'm not sure if the upgrade has something to do with the segfault of influx_inspect so I'm adding my upgrade steps as well.
Afterwards I wanted to use "influx_inspect buildtsi -compact-series-file ..." but it crashes with a "fatal error: fault" I tried starting the container with those data files but the container just loops and will not start.
Environment info: My docker environment runs on Raspberry Pi 4 with the data stored on an external SSD.
When influxdb container is running there is enough free memory available.
Steps to reproduce:
Successful upgrade 1.7.10 to 1.8.0 steps I read https://docs.influxdata.com/influxdb/v1.8/administration/upgrading/ and decided to upgrade my container from 1.7.10 to 1.8.0 and also use tsi1 index.
My starting docker-compose definition was:
I had no index files as mentioned in 3c) of the upgrade instructions So my understanding was that I didn't need to prepare the data somehow because I had no index files.
My config file/container environment variables contained no index-version setting as mentioned in 3a)
I then stopped the container and backuped the data files
Started a temporary container to build the index
Ran inside the container the "influx_inspect buildtsi ..." which ran fine reported only "info" messages and as far as I can tell reported no errors.
Modified docker-compose definition from above with the following values
Started container
Checked my grafana dashboards afterwards and the data from last year for all sensors looked fine.
Failed steps to compact-series in 1.8.0 Stopped the 1.8.0 container
Created a temporary container with influxdb and access to the data files
Ran the compact-series command as described here: https://docs.influxdata.com/influxdb/v1.8/administration/compact-series-file/ And you can see the result below