All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
1.17.0 - 2024-01-16
This release introduces the new memfaultctl write-metrics
command
as well as some important bugfixes.
memfaultctl write-metrics
, which allows users to write metrics tomemfaultd
from a script or shell by specifying them as arguments to the command in the formKEY=VALUE
.- 2 new built-in system metrics under the new
diskstats
metric category. These metrics follow the naming patterndiskstats/<device name>/reads_per_second
anddiskstats/<device name>/writes_per_second
where<device name>
is the name of a device listed in/proc/diskstats
. The list of devices monitored can be configured with themetrics.system_metric_collection.diskstats
configuration field.
- The log message emitted when the MAR cleaner
encounters an invalid MAR entry in the MAR
staging area has been lowered from
WARN
level toDEBUG
level. - A log message emitted when the configured
high_resolution_telemetry.max_samples_per_minute
rate limit is violated has been lowered fromWARN
level toDEBUG
. This is to avoid repeatedly logging the same message at a high frequency when the system is sending more readings than permitted.
- All deltas calculated based on a current and previous counter from procfs now take potential overflow into account.
- Fixed a sequencing issue with the cleaning of the MAR directory concerning an edge case where logs would fail to recover on devices that are near their disk space or inode quotas. This would result in stranded log files that would not get deleted or uploaded to Memfault until there is enough space to recover them.
1.16.1 - 2024-01-06
This is a small release that adds coredump capture support for 32-bit ARM targets using musl libc.
- The
coredump
feature can now be compiled for thearmv7-unknown-linux-musleabihf
target.
1.16.0 - 2024-12-20
This release includes two major new features:
- A new coredump capture mode to process the coredumps directly on device resulting in even smaller crash signature and guaranteeing that no user data is uploaded;
- High Resolution Telemetry for Linux, storing every single datapoint and enabling second-by-second data from Linux devices, for even deeper debugging capability.
- Built-in wireless metric capture - wireless interfaces being monitored by the
metrics.system_metric_collection.network_interfaces
configuration will now have RSSI captured alongside the existing network metrics - Added a
boottime_duration_ms
to Metric Reports that indicates the duration of the report independent of whether the CPU was running or not. This is intended to be used for debugging purposes for teams whose devices sleep often while in use. - The new on-device stack unwinding option for capturing crash Traces, which can
be enabled by setting
coredump.capture_strategy.type
tostacktrace
inmemfaultd.conf
. - High resolution telemetry, which is enabled by default and can be disabled or
have its rate limiting configured with the
metrics.high_resolution_telemetry
field. - Support for Yocto Scarthgap in
meta-memfault
. This support will be extended tometa-memfault-example
in an upcoming release. Check out thescarthgap
branch if you are using that version of Yocto! - The built-in system memory metrics now include additional states such as Cached and Buffered.
- The
metrics.statsd_server
has had a new option,legacy_gauge_aggregation
, added that averages Gauge metric readings rather than simply storing the most recent value. This is meant to allow for a smooth migration fromcollectd
's StatsD server tomemfaultd
's. New integrations should use theh
(Histogram) metric type for metrics whose readings should be aggregated via average.
- The default on-device rate limit for logs ingested by
memfaultd
has been increased from 500 lines per minute to 1000 lines per minute
- Arbitrary ASCII strings between 1 and 128 in length can now be used in StatsD
metric keys (besides the
:
delimiter), matching the behavior described in the docs.
Some built-in metrics were removed, as the naming conventions were not consistent. All of these metrics were added in 1.15.0. The list can be found below:
cpu_usage_pct
connectivity_recv_bytes
connectivity_sent_bytes
connectivity_<interface>_recv_bytes
connectivity_<interface>_sent_bytes
memory_used_pct
memory_<process>_pct
cpu_usage_<process>_pct
storage_used_<disk>_pct
1.15.1 - 2024-10-21
This is a patch release to fix a bug in our release process that caused the
Cargo lock file to get out of sync with our other published crates. This only
affected users building with --locked
. No other changes were made.
- Fixed a bug in our release process that allowed the Cargo lock file to get out
of sync with our other published crates. This caused a build failure when
building with
--locked
.
1.15.0 - 2024-09-30
- Add a new per-process
operational_crashes
metric. This is similar to the system wideoperational_crashes
metric, but is broken down by service. The metric will be namedoperational_crashes_<process name>
, where<process name>
is the name of the executable binary for the crashed process. - Add a builtin log level extractor for all logs that are ingested by
memfaultd
. This allows you to define regex for each log level so that the level reported to Memfault matches the level inside the log message. This is useful for logs fromsystemd
for example that do not always have a matching level in the metadata and the message. - We've added a host of new builtin metrics:
cpu_usage_<process>_pct
- Percent CPU usage of a specific process.memory_<process>_pct
- Percent memory usage of a specific process.storage_used_<disk>_pct
- Percent disk usage of a specific disk.connectivity_<sent/recv>_bytes
- Bytes sent and received over a network interface.connectivity_<interface>_<sent/recv>_bytes
- Bytes sent and received over a specific network interface.
- The builtin system metric configuration is now enabled by default. This means
that
memfaultd
will now collect metrics on CPU, memory, disk usage, and much more out of the box. Note that this means some metrics previously reported viacollectd
will now have different keys. We hope this will make it more clear what each metric actually represents on a device - if this causes issues with your Project configuration please don't hesitate to reach out to us for support! memfaultd
's internalstatsd
server is now enabled by default. This allows you to send custom metrics tomemfaultd
using the StatsD protocol. without having the first route them throughcollectd
.- We've made some changes to increase performance in our log processing pipeline. This should result in a lower CPU usage when processing logs.
- Previously a small number of log messages written by
memfaultd
did not have a level. We've updated them to now have a level matching the severity of each message. - The
journald
log source is now the default log collector. This will use our internal processor for grabbing logs directly from the systemd journal. If you are not using systemd, you can still use thefluent-bit
log source.
- Fixed a bug where leading and trailing whitespace was being added to attribute
names when using
memfaultctl write-attributes
. - Previously
memfaultd
would panic if a MAR attachment was not an absolute path or was not a file. This has been changed to simply log an error and continue. - Fixed a bug in which some metrics captured before enabling data collection were uploadedto Memfault once data collection was enabled. No data captured on devices for which data collection was always disabled would be uploaded as a result of this bug.
1.14.0 - 2024-08-21
This is a minor release consisting mostly of refactors, but we have also added
some cool new features! We have added support for custom data recordings, which
allows you to record arbitrary data on your device and have it uploaded to
Memfault. We have also added support for capturing the contents of the pstore
file system as a custom data recording whenever there is a kernel panic. This
will allow you to inspect any logs or data that were present at the time of the
panic.
- Support for custom data recordings has been added! This feature allows you to record arbitrary data on your device and have it uploaded to Memfault. This is useful for data that doesn't fit into the existing metrics or logs categories.
memfaultd
will now capture the contents of thepstore
file system as a custom data recording whenever there is a kernel panic. It will be uploaded as a custom data recording namedpstore
. This feature is disabled by default and can be enabled with thereboot.capture_pstore
configuration option.- A new core metric
memory_pct
has been added to track the memory utilization of the device. This metric represents the percentage of memory used by the device at the time of the metric report.
- The
libmemfaultc
C library was moved under thememfaultc-sys
crate directory, enabling publishing of thememfaultd
crate. - Allow
-
characters in statsd metric names that are ingested by the builtin StatsD server. - We've started a refactor of the
memfaultd
codebase to centralize our usage of concurrency primitives. This will make the codebase more maintainable and easier to reason about. You can find this new concurrency framework in thessf
crate. - A host of internal changes to allow us to publish to crates.io! Look forward
to seeing
memfaultd
on crates.io soon! - Removed per core builtin cpu metrics. These were not very useful and were creating a lot of noise in the timeline view. Tracking of CPU utilization can still be done aggregated across all cores.
- Lowered log level for system metric collection logs from
warn
todebug
. These logs were very spammy, and were not actionable by end users.
- The MSRV for
memfaultd
has been bumped from 1.65.0 to 1.72.0. If you are using the includedmeta-rust-bin
layer you won't need to change anything. Otherwise, you may need to update your Rust toolchain to a version that is at least 1.72.0.
1.13.0 - 2024-07-15
This is a minor release with a few bug fixes and couple of new metrics related
features. Notably, we are continuing to add support for more metrics collection
within memfaultd
itself. This will remove the dependency on collectd in a
future release. We encourage users to start switching to
built-in metrics collection
if all their metrics need are met with CPU, Memory, Disk, Network, Process
Monitoring and StatsD. Expect built-in metrics collection to become the default
later this year.
memfaultd
will now automatically collect thermal zone metrics when system metric collection is enabled.memfaultd
will now automatically collect network interface metrics when system metric collection is enabled. Themetrics.system_metric_collection.network_interfaces
configuration option can be used to specify which interfaces should be monitored. If it is unset, all non-loopback interfaces will be monitored.memfaultd
will now automatically collect disk usage metrics when system metric collection is enabled. Themetrics.system_metric_collection.disk_space
configuration option can be used to specify which disk should be monitored. If it is unset, all disks whose ID starts with/dev/
will be monitored.memfaultd
can now collect metrics that track the resource utilization of individual processes on the system. By default, it will track itself. Themetrics.system_metric_collection.processes
configuration option can be used to specify which process namesmemfaultd
should monitor.memfaultd
's built-in StatsD server can now ingest Timer readings.
- Daily heartbeat metric reports are now disabled by default.
- The local version of
meta-rust-bin
is updated with the Rust versions up to 1.79.0. memfaultd
now has a separate repo to host its source code! This is a part of an ongoing effort to move thememfaultd
source code outside ofmeta-memfault
. The end goal is to simply have a pointer to thememfaultd
repository in thememfaultd
bitbake recipe, but for now the structure of this repository is unchanged. Check it out at for yourself here!
- A set of logs that were often emitted when
memfault-core-handler
captured a coredump have had their log level lowered fromwarn
todebug
.
- Some work-in-progress files for a new program called
memfault-watch
have been added. Keep an eye out for it to be released soon!
1.12.0 - 2024-05-28
memfaultd
will now try to detect automatically Software Version (/etc/os-release
), Software Type (xxx
), Hardware Version (xxx
), and Device ID (xxx
). These new defaults continue to be overridden bymemfaultd.conf
andmemfaultd-device-info
.- String metric values can now be passed to
memfaultctl start-session
andmemfault end-session
. memfaultd
now has a built-in StatsD server that can be enabled via themetrics.statsd_server
configuration. This allows StatsD clients to write custom metrics directly tomemfaultd
without needing to send them tocollectd
to eventually be flushed. This system is turned off by default in this release.memfaultd
can now capture some system metrics itself withoutcollectd
running on the system. This can be turned on viametrics.system_metric_collection.enable
. Currentlycpu
andmemory
metrics can be collected and further system metrics will be added in the next release. This system is turned off by default in this release.- Logs can now be directly collectd by
memfaultd
withoutfluent-bit
running. This can be configured via thelogs.source
option. This release adds thejournald
log source option which will pull logs from the systemd's journal logging system (on systems that are not using systemd, this will not collect any logs). - Added a
Cross.toml
tometa-memfault/recipes-memfault/memfaultd/files/
so thatmemfaultd
can be cross-compiled withcross
with the necessary system dependencies automatically pulled into the build.
- The
default
set of features in the top-levelCargo.toml
.swupdate
is removed. The goal with this set of features is to provide a target set of features that will compile a standalonememfaultd
application for most systemd-based systems.
- A bug in which metric reports with a duration shorter than 60 seconds had the possibility of reporting 0 crashes when one or more did in fact occur during the duration of the report.
1.11.0 - 2024-04-10
This release builds upon the session-based metric reporting first shipped in 1.10.0 as well as adds an exciting new feature: the capture of system logs in coredumps. This means in addition to everything the Memfault Linux SDK currently captures in a coredump when a process on your device crashes, a configurable number of system log lines will also be captured in the coredump and visible on the Memfault web app! Additionally, we've added a new type of metric report in addition to session reports and the periodic heartbeat - a daily heartbeat report. This is intended to allow devices that are not able to report a full set of periodic heartbeats to Memfault to provide a once-a-day report that aggregates metrics over a 24 hour window.
- Log storage persistence configuration - via the new
logs.storage
config field users can select whether logs processed bymemfautld
should be written to disk or not. If they are not written, creating metrics based on log patterns and including logs in coredumps will still function but log files will not be uploaded to Memfault. The intention behind this option is to limit the number of unnecessary disk writes for systems that are not sending full log files to Memfault. - Daily heartbeats - a once-a-day metric report that aggregates 24 hours of
metrics captured from your device. It can be enabled with the new
metrics.enable_daily_heartbeat
configuration. - The
memfaultctl start-session/end-session
command has been updated to optionally accept Gauge metric readings that will be added to the resulting metric report. Readings are accepted in the format<metric key>=<number>
, similar to thememfaultctl write-attributes
command. Example:memfaultctl end-session camera_recording recording_failed=0
. - Logs from around the time of the crash are now recorded in coredumps and
displayed in the Memfault web app. This can be configured via
coredumps.log_lines
, which has a default of 100 (meaning the 100 most recent log lines will be recorded in a coredump) memfaultd
will now dump all ongoing sessions when it shuts down.- Adds an internal circlular queue implementation (used by logs in coredumps feature).
- Memfault Core Metrics
are always captured in a session metric report, regardless of the
captured_metrics
configuration for that session type. - The internal names of the
MetricReading
variants have been updated make it more clear how each type is aggregated. - Some internal refactoring to centralize the constants for the keys of Memfault Core Metrics.
- Session names that conflict with the reserved names of "heartbeat" and "daily-heartbeat" are now rejected.
- Starting a session of a type that is already in progress is now a no-op. Previously, this reset the metric report for that session type as if the prior one had never been started.
- Updated the path in
memfault-core-handler
for/proc/<pid>/maps
to be an absolute path instead of a relative path to keep behavior consistent regardless of where thememfault-core-handler
binary is executed from. - Some flaky
memfaultd
tests have been updated to consistently pass. - Battery readings with an SOC % less than 0 or greater than 100 are now rejected.
- Empty log files are no longer written to disk nor are they subsequently uploaded to Memfault.
- A bug that caused some coredumps from 32-bit machines to be captured incorrectly while in thread capture mode.
- Bump version for external dependency
reqwest
to 0.11.26 to address a vulnerability in earlier versions.
1.10.0 - 2024-02-15
This release introduces support for session-based metric reporting in the Memfault Linux SDK! While full support in the Memfault web application is coming soon, starting with this release users can begin experimenting with how to best make use of sessions for their devices. Sessions allow for the capture of metrics aggregated over dynamic spans of time in addition to the periodic heartbeat. If you are interested in trying out this feature while it is brewing, please contact us for more details!
- Support for the
sessions
memfaultd.conf
field. This configuration allows users to define sessions and specify which metrics should be captured for each type of session. - New
memfaultctl
commandsstart-session
andend-session
to start and end session metric reports respectively. Whenend-session
is called for an ongoing session, a MAR file with the aggregated metrics for that session is dumped to disk and will be uploaded to Memfault at the next upload interval. - The
"linux-metric-report"
(formerlylinux-heartbeat
) MAR type now contains a"report_type"
field indicating what type of metric report it is. Currently the two types produced bymemfaultd
areheartbeat
andsession
(with the latter including a field for the name of the session)
- The MAR type
"linux-heartbeat"
has been renamed"linux-metric-report"
. memfaultd
can now capture multiple metric reports at once. This enables the capture of a session-based metric report without interrupting the capture of the periodic heartbeat report.- Updated the whitespace formatting in the default
memfaultd.service
file for more broad drop-in compatibility acrosssystemd
versions. - Reboot reasons are now uploaded to the Memfault backend regardless of the fleet sampling resolution for the device.
- The preferred version for dependencies of the
memfaultd
bitbake recipe is now set with the?=
operator to avoid conflicts with other recipes in users' Yocto build that also use these dependencies. libmemfaultc
is now compiled with the -fPIC flag- The error message that is printed when
memfaultctl
can't find a PID file formemfaultd
has been modified to make the source of the error more clear. - The log level for the log output when the last reboot reason file cannot be
found has been downgraded to
debug!
.
- A bug in which MAR files generated from a pre-1.9.0 version of
memfaultd
could not be parsed and uploaded by versions 1.9.0 and 1.9.1.
1.9.1 - 2024-1-5
This is a small patch release to fix a bug we discovered in
memfault-core-handler
- A bug in
memfault-core-handler
that caused a small number of coredumps in which PT_PHDR is not the first entry in the program header table to be captured incorrectly when inthreads
mode.
- Add recording of
info!
,warn!
anderror!
logs from during coredump capture to a note in the core ELF to allow for better visibility into coredump capture errors.
1.9.0 - 2023-12-14
We are excited to introduce support for the
Memfault Core Metrics
in memfaultd
with this release. These metrics enable the monitoring of
connectivity, battery usage, and crashiness out-of-the-box with minimal
configuration.
This release also adds the ability to convert logs into metrics on the edge.
memfaultd
now supports built-in capture of connectivity, battery, and crashiness metrics. See our docs on Memfault Core Metrics.- Crashiness is measured automatically (any hour without a coredump collected will count as a crash-free hour).
- Battery and connectivity are supported via the
battery_monitor
andconnectivity_monitor
configuration options, and the newmemfaultctl
commandsadd-battery-reading
,report-sync-success
, andreport-sync-failure
.
- The ability to convert log into metrics. Further details on this new feature are provided in our logging guide
- Support for string reboot reasons in addition to the existing reset code integers. This allows users to define custom reboot reasons specific to their device or domain. (Visualizations of these custom reboot reason in the backend will be shipped in the next few days).
- The behavior of how Gauge collectd metrics are aggregated on edge. Prior to this change, the last reading collected for a gauge metric was what was sent to Memfault in the interval's heartbeat. Now, the value sent up for a gauge is the average of all readings for that metric within a given heartbeat.
- openssl version bumped to 10.60.0
- Added the
recommended settings
DeleteGauges
andDeleteCounters
totrue
inmeta-memfault-example
'scollectd.conf
. This is so metrics whose readings are sparsely collected do not use resources being uploaded in every upload interval. These can be set tofalse
if alternate behavior is desired (collectd docs here)
- A bug related to how
memfaultd
collects coredumps that caused a very small number of stacktraces to not be displayed properly in the Memfault app. - Added
openssl
tometa-memfault-example
's Fluent Bit bitbake recipe. - Added
zlib
as a dependency ofmemfaultd
in its bitbake recipe.zlib
is required as we use thezlib
backend for theflate2
crate. - Fixed a bug which caused
memfaultd
to create an empty logfile on shutdown.
1.8.1 - 2023-11-6
This is a small release to address a bug we discovered in how MAR entries' estimated size was being calculated.
This bug can cause MAR entries to be deleted before upload when you are using
the logging feature with significant logging activity. We recommend all
customers using the logging
feature to update to this version.
- Fix the logic for estimating the number of inodes that a MAR entry will take on disk.
1.8.0 - 2023-10-25
In this release, we have worked on improving coredump captures and now support different capture modes which will greatly reduce the size of coredumps and make them more useful (see our Coredump Guide).
- The coredump capture strategy can now be set to capture the thread stacks of
the crashed program or all the memory regions included in the ELF core file
generated by the kernel This behavior is defined by
coredump.capture_strategy
inmemfaultd.conf
. memfault-core-handler
now captures the program arguments in the coredump metadata. They will be displayed in the Memfault Dashboard.- A device's software type and version can now be set via
memfault-device-info
withMEMFAULT_SOFTWARE_TYPE
andMEMFAULT_SOFTWARE_VERSION
respectively. - The device's software type and version are now validated in
memfaultd
. Invalid values will trigger an immediate error. - The max age of an unuploaded MAR entry can now be configured in
memfaultd.conf
viamar.mar_entry_max_age_seconds
. Once a MAR entry's age is greater than this value it will be deleted from disk regardless of if it has been uploaded or not. The default is set to 7 days.
- The default coredump capture strategy is now 32 KB threads. This means that
capturing the top 32 kilobytes of every thread instead of capturing all memory
selected in the kernel configuration. The previous default behavior matched
setting
coredump.capture_strategy.type
tokernel_selection
.
- A bug in
meta-rust-bin
that would cause build errors when the target architecture is the same as the builder - Bumped the base Docker image to use
ubuntu:jammy
instead of the deprecatedubuntu:kinetic
. - A bug causing logs of any level to be written to the kernel logs from the
memfault-core-handler
- A number of unused dependencies in
memfaultd
's Bitbake recipe (vim-native, cmake-native, zlib).
1.7.0 - 2023-09-27
This new release will mostly interest users of Memfault for Linux on systems that are not directly connected to the Internet and need another way to transport the data from the device to the cloud.
memfaultd
now supports exporting its data in multiple formats via it's built-in HTTP server. The easiest way to use this new feature is viamemfaultctl export
. Supported formats are MAR (Zip file), Chunk (MAR file encapsulated in a memfault chunk), Chunk-Wrapped (a chunk with an additional header containing a signature and the chunk length).
- We have rewritten our core handler in Rust. In this release, the core handler is iso-functional to the previous release. New features coming soon!
- When building with OpenSSL,
memfaultd
will not try to build rustls-tls. This fixes an issue wherememfaultd
would not build on some systems (mips) because a dependency ofrustls-tls
(ring
) does not build onmips
.
1.6.0 - 2023-09-06
We dropped, or made optional, a number of dependencies. Memfault for Linux will be easier to integrate and run on a wider variety of configuration.
memfaultd
does not requiresystemd
anymore. On Yocto, thesystemd
feature will be activated automatically if your distribution includes systemd. Whensystemd
is not used,memfaultd
will not be able to detect "user triggered" shutdown or reboot. You should use thelast_reboot_reason
file API to notifymemfaultd
before doing a normal shutdown.memfaultd
will now default to using a Rust TLS library in place of OpenSSL. This adds about 800kB to thememfaultd
binary. If you do have OpenSSL on your system and prefer to use it, you can set theopenssl-tls
option (in yourPACKAGECONFIG
for thememfaultd
recipe) to continue using OpenSSL.memfaultd
now supports anupload_interval
set to 0. Whenupload_interval
is 0,memfaultd
will never try to upload data on its own. Data will be written to disk and deleted when the size or inode limits are exceeded. You can still callmemfaultctl sync
(or send the SIGUSR1 signal) to force an immediate upload.
- Cleaned up some of the upload code to remove some dead paths and gain a bit of code size.
- Fluent Bit changed their output format in version 2.1. This version of
memfaultd
supports both the old and new format.
1.5.0 - 2023-07-18
This release introduces fleet-sampling to the Linux SDK. It
also uniformizes all data transfers to use the MAR (Memfault Archive) format,
including heartbeats messages which were previously uploaded directly by
collectd to the cloud. memfaultd
now exposes a local http endpoint to receive
metrics and will upload them as MAR files.
Finally it mostly wraps up our transition to Rust. C is only used to parse
coredumps (the last piece we will eventually rewrite) and in a few rare places
where using Rust is not practical or not possible (calling libsystemd
, using
libconfig
, and triggering a segfault).
This release includes a number of changes that will require changes in your integration.
Specifically:
- collectd now uses a static configuration file (it's not generated by
memfaultd
at run-time). You will need to change yourcollectd.conf
file. See the metrics guide for more details. - Build-time and run-time configuration references to
plugins
have all been dropped. Refer to the release notes below for more details.
memfaultctl enable-dev-mode
now prints a link to the Memfault dashboard where the user can remove server-side limits for this device.- Memfault SDK for Linux now supports fleet sampling: you can configure on the server which devices are sending coredumps, logs and metrics. Read our Fleet Sampling documentation for more information.
memfaultd
now exposes a HTTP server to receive data fromcollectd
. Metrics can be sent at any frequency andmemfaultd
will aggregate them and upload them on the heartbeat interval (by default, every 60 minutes). Metrics are stored and uploaded using MAR entries.- The configuration variable
heartbeat_interval_seconds
controls the aggregation frequency of metrics. - All commands now support the
--verbose
/-V
flag to show extra debug information.
-
The configuration value
refresh_interval_seconds
has been renamed toupload_interval_seconds
to avoid confusion with the newheartbeat_interval_seconds
. -
The names of the
memfaultd
configuration options have been changed to drop theplugin_
prefix. If you are manually setting thePACKAGECONFIG
variable for thememfaultd
recipe, you will need to update it.For example, to enable all features (the default):
-PACKAGECONFIG := "plugin_swupdate plugin_collectd plugin_coredump plugin_logging" +PACKAGECONFIG := "swupdate collectd coredump logging"
We are moving away from the terms "plugins" to "features" to describe the different components of the Memfault SDK for Linux. This better represents how they are built and enabled.
Read the integration-guide for more information.
-
The names of some configuration options have been changed and you will need to edit your configuration file if you changed any of them:
coredump_plugin
is nowcoredump
,swupdate_plugin
is nowswupdate
,reboot_plugin
is nowreboot
.-"swupdate_plugin": { +"swupdate": { "input_file": "/etc/swupdate.cfg", "output_file": "/tmp/swupdate.cfg" }, -"reboot_plugin": { +"reboot": { "last_reboot_reason_file": "/media/last_reboot_reason" }, -"coredump_plugin": { +"coredump": { "coredump_max_size_kib": 96000, "compression": "gzip", "rate_limit_count": 5, "rate_limit_duration_seconds": 3600 },
-
Coredumps are now stored and uploaded using MAR files.
-
Reboots are now stored and uploaded using MAR files.
-
Memfault SDK for Linux is now written mostly in Rust. The use of C code is now minimal.
-
Our recommended configuration for collectd (
collectd_%.bbappend
) is now inmeta-memfault
. It was previously inmeta-memfault-example
.
memfaultd
will not write thecollectd
configuration file automatically on startup. We recommend using a static configuration file instead.- We have removed external dependency on
libjsonc
andlibuuid
with the rewrite in Rust.
- Fixed build issues for some combination of feature flags.
- Bug where
memfaultd
would show a warning about systemd being in an unexpected state when restarting. - Bug where we would show warnings about invalid
memfault-device-info
output multiple times.
1.4.0 - 2023-04-25
This release includes a number of changes that will require changes in your project:
- Edit your
bblayers.conf
to stop usingmeta-rust-bin
layer from the rust-embedded GitHub account and use the version provided in thememfault-linux-sdk
repository. - Edit
memfault.conf
to replacedata_dir
bypersist_dir
and carefully reviewtmp_dir
(which defaults topersist_dir
) and associated options to control maximum usage and minimum headroom. You will most likely need to set your own values. - If you were calling
memfaultd --enable-data-collection
before, you need to replace it bymemfaultctl enable-data-collection
now.
- Memfaultd will now consider the amount of disk space and inodes remaining on
disk when writing logs, storing coredumps and when cleaning the MAR staging
area. See new options
tmp_dir_min_headroom_kib
,tmp_dir_min_inodes
andtmp_dir_max_usage_kib
in the configuration file. - Logging is now rate limited on device (defaults to 500 lines per minute - see
max_lines_per_minute
). - We simplified the configuration options relative to data storage. Users are
now expected to set a
persist_dir
option that must be persisted across reboots and atmp_dir
option that can be cleared on reboot (a temp filesystem in RAM). Refer to Memfault Integration Guide - Storage for more details. - Option
logs.compression_level
to set the logs compression level.
- Memfault Linux SDK now ships with a version of
meta-rust-bin
using a renamed Yocto classcargo_bin
. This was required due tometa-rust-bin
being incompatible with some poky packages. We will track the upstream bug and switch back to upstreammeta-rust-bin
when possible (see meta-rust-bin#135). memfaultd
does not include the commandsenable-dev-mode
andenable-data-collection
anymore (they were deprecated in 1.2.0.)- We now consider logging to be ready for production use and have turned on
plugin_logging
by default. - Some CMake improvements to build with older versions of GCC.
- Rewrote more
memfaultctl
commands to rust:trigger-coredump
,show-settings
,sync
,write-attributes
,enable-dev-mode
andenable-data-collection
.
- Configuration options
logs.tmp_folder
,mar.storage_max_usage_kib
,coredump.storage_max_usage_kib
andcoredump.storage_min_headroom_kib
have been removed and are replaced by the new options listed above. memfaultd --enable-data-collection
and--enable-dev-mode
(as well as--disable...
) have been removed.
- Bug causing coredump-handler to not capture coredumps in development mode.
- Bug causing coredump-handler to create a ratelimiter in the wrong place and fail the capture when it did not have permission to create the file.
- Fluent-bit connector will drop all logs when data collection is not enabled.
- Fluent-bit recommended configuration now includes a
Retry_Limit
. - Wait until memfaultd is ready to write PID file.
- Fixed occasional error message
error sending on closed channel
on shutdown. - Fix bug where
memfaultd
andmemfaultctl
would not properly report their version number. - Show immediate error to the user when
memfaultctl write-attributes
is called but data collection is disabled. - Fix build error when logging was disabled.
1.3.2 - 2023-04-06
- The Yocto layer meta-memfault does not depend on swupdate, collectd and fluent-bit anymore. Instead these dependencies are added by the memfaultd recipe and only when the corresponding plugins are enabled.
- Fix Yocto recipe to always enable network access during compilation and add
openssl
as a dependency. - Updated architecture diagram to include fluent-bit
1.3.1 - 2023-03-22
- Add configuration in
meta-memfault-example
to run on Raspberry Pi 2/3/4.
- Log files are now stored compressed on disk to reduce disk usage.
- To upload Memfault MAR entries (including logs), they are now streamed directly from disk without writing the MAR zip file to disk. This reduces disk I/O (flash wear) and means logs are only written once to disk which is optimal.
- Display server error text for Memfault API endpoints. This helps debug configuration issues.
- Validate the provided
device_id
and show an error if it will not be accepted by Memfault. - Removed memfaultd dependency on libuboot. It was used to detect OTA reboots
but we are now configuring swupdate to call
memfaultctl reboot --reason 3
after installing an upgrade.
- Fixed consistency of logfiles' Cid/NextCid which will help the Memfault dashboard identify discontinuity in the series of logs.
- Fixed the sleep duration displayed after a network error (memfaultd would announce sleeping for an hour but it would actually retry sooner).
- Fix a configuration problem where
collectd
logs would not be visible in the Memfault Dashboard (logs sent only to syslog are not captured by the default configuration - we are now configuringcollectd
to log to the standard output which is captured byjournald
).
1.3.0 - 2023-03-06
- Memfault SDK on Linux now supports Memfault archives (MAR), also used in our Android SDK. Going forward this is how all data will be stored on disk.
- A local TCP endpoint, compatible with fluent-bit tcp output plugin, is now available to capture logs. Logs are written to disk in MAR (Memfault ARchive) format and uploaded to Memfault when the device is online. This feature is in technical preview stage and is disabled by default. See logging on linux for more information.
meta-memfault-example
now includes fluent-bit to demonstrate how to collect logs.- Memfault Linux SDK is now partially written in Rust. Our Yocto layer requires
cargo and rust 1.65.0. We recommend meta-rust-bin from the rust-embedded
project.
- 🚧
memfaultd
in the Linux SDK is currently a mix of C code and Rust. Please excuse the noise while we continue construction. 🚧
- 🚧
- Memfault agent can now be built on Linux and macOS systems (
cargo build
).
memfaultd
can now capture coredumps of itself.
- Fix bug where we restarted swupdate instead of swupdate.service. This removes a warning in the logs.
- Added link to the changelog in the release notes.
- Fix a bug where memfault would ignore SIGUSR1 signal while it was processing uploads.
- Fix a bug in the coredump capturing code that would cause a crash in case more than 16 warnings got emitted during the capture process. Thanks to @attilaszia for reporting this issue.
1.2.0 - 2022-12-26
- [memfaultctl] Added a new command
memfaultctl
to interact withmemfaultd
.memfaultctl trigger-coredump
to force a coredump generation and upload.memfaultctl request-metrics
to forcecollectd
to flush metrics to Memfault.memfaultctl reboot
to save a reboot reason and restart the system.memfaultctl sync
to processmemfaultd
queue immediately.memfaultctl write-attributes
to push device attributes to Memfault.- 'Developer Mode` to reduce rate limits applied to coredumps during development.
- Our Docker container now runs on Apple silicon without Rosetta emulation.
- Updated the
memfault-cli
package in the Docker image. - Added "preferred versions" for
swupdate
andcollectd
. - Coredumps are now compressed with gzip reducing storage and network usage.
memfaultd
is now built with-g3
.
memfaultd --(enable|disable)-dev-collection
andmemfaultctl -s
are now replaced by equivalent commands onmemfaultctl
and will be removed in a future version.
swupdate
would get in a bad state after reloadingmemfaultd
. This is fixed by restarting bothswupdate
andswupdate.socket
units.
1.1.0 - 2022-11-10
- [memfaultd] A new
last_reboot_reason_file
API has been added to enable extending the reboot reason determination subsystem. More information can be found in the documentation of this feature. - [memfaultd]
memfaultd
will now take care of cleaning up/sys/fs/pstore
after a reboot of the system (but only if the reboot reason tracking plugin,plugin_reboot
, is enabled). Often, systemd-pstore.service is configured to carry out this task. This would conflict withmemfaultd
performing this task. Therefore, systemd-pstore.service is automatically excluded when including themeta-memfault
layer. Note thatmemfaultd
does not provide functionality (yet) to archive pstore files (like systemd-pstore.service can). If this is necessary for you, the work-around is to create a service that performs the archiving and runs beforememfaultd.service
starts up.
- [memfaultd] When
memfaultd
would fail to determine the reason for a reboot, it would assume that "low power" was reason for the reboot. This makes little sense because there are many resets for whichmemfaultd
is not able to determine a reason. This fallback is now changed to use "unspecified" in case the reason could not be determined (either from the built-in detection or externally, via the newlast_reboot_reason_file
API). Read the newlast_reboot_reason_file
API for more information. - Various improvements to the QEMU example integration:
- It can now also be built for
qemuarm
(previously, onlyqemuarm64
was working). - Linux pstore/ramoops subsystems are now correctly configured for the QEMU example integration, making it possible to test out the tracking of kernel panic reboot reasons using the QEMU device.
- It can now also be built for
- [memfaultd] The unit test set up is now run on
x86_64
as well asi386
to get coverage on a 64-bit architecture as well as a 32-bit one.
- [memfaultd] Building the SDK on 32-bit systems would fail due to compilation errors. These are now fixed.
- collectd In the example, the statsd plugin would be listening on all network interfaces. This is narrowed to only listen on localhost (127.0.0.1).
- [memfaultd] Many improvements to reboot reason tracking:
- Intermittently, a reboot would erroneously be attributed to "low power".
- Kernel panics would show up in the application as "brown out reset".
- Sometimes, multiple reboot events for a single Linux reboot would get
emitted. The root causes have been found and fixed. Logic has been added
that tracks the Linux
boot_id
to ensure that at most one reboot reason gets emitted per Linux boot. - When using the example integration, the reboot reason "firmware update"
would not be detected after SWUpdate had installed an OTA update. This was
caused by a mismatch of the
defconfig
file in the example integration and the version of SWUpdate that was being compiled. This is now corrected.
- [memfaultd] Fixed a bug in queue.c where an out-of-memory situation could lead to the queue's mutex not getting released.
- Improved the reliability of some of the E2E test scripts.
- When
memfaultd --enable-data-collection
is run and data collection had not yet been enabled, it will regenerate the SWUpdate configuration and restart theswupdate.service
. This restart can cause SWUpdate to get into a bad state and fail to install OTA updates. This is not a new issue and was already present in previous releases. We are investigating this issue. As a work-around, the device can be rebooted immediately after runningmemfaultd --enable-data-collection
. - The systemd-pstore.service gets disabled when including
meta-memfault
, even ifplugin_reboot
is disabled. As a work-around, if you need to keep systemd-pstore.service, remove thesystemd_%.bbappend
file from the SDK.
1.0.0 - 2022-09-28
- This release is the first one including support for collecting and uploading user-land coredumps to the Memfault platform. The coredump plugin is enabled by default. Alongside this SDK release, an accompanying Memfault CLI version 0.11.0 aids in uploading symbol files to Memfault from Yocto builds to facilitate making use of the new functionality. Uploading symbols is a necessary step in order to use Memfault for coredumps. Read more about coredump support in the Memfault Linux SDK.
- Breaking changes in the format of
/etc/memfaultd.conf
(see the updated reference):- The
collectd
top-level key was merged into thecollectd_plugin
top-level key. The fields previously incollectd
that have been moved tocollectd_plugin
are:interval_seconds
non_memfaultd_chain
write_http_buffer_size_kib
- The
collectd_plugin.output_file
key has been replaced by two new keys:collectd_plugin.header_include_output_file
: the value of which should be included as the first statement in your/etc/collectd.conf
file, andcollectd_plugin.footer_include_output_file
: to be included as the last statement of your/etc/collectd.conf
file.
- The
- A misconfiguration bug whereby setting
collectd.interval_seconds
(nowcollectd_plugin.interval_seconds
, see the "Changed" section of this release) would have no effect if our include file was at the bottom of/etc/collectd.conf
. It happened due to the fact that collectdInterval
statements are evaluated as they appear in source code (see the author's statement), only affecting the plugin statements that come after it.
The server-side issue mentioned below has been resolved in the meantime.
Temporarily, our backend processing pipeline is unable to process coredumps
that link to shared objects in a specific style. This affects, in particular,
coredumps coming from devices on the Dunfell release of Yocto.
A backend fix has already been identified and should be released in the next
few business days. Once released, any previously collected coredumps that are
affected will be reprocessed server-side to address this issue. This will
not require any action from your team.
0.3.1 - 2022-09-05
- Support for Yocto version 3.1 (code name "Dunfell"). See the
dunfell
branch of the repository.
- The SDK repository no longer has a
main
branch. The variant of the SDK that supports Yocto 4.0 ("Kirkstone") can be found on the branch namedkirkstone
. Likewise, the variant of the SDK that supports Yocto 3.1 ("Dunfell) can be found on the branch calleddunfell
.
0.3.0 - 2022-08-31
- Initial support for collecting metrics using collectd. Check out the docs on Metrics for Linux for more information.
0.2.0 - 2022-08-10
This is our first public release. Head over to our Linux documentation for an introduction to the Memfault Linux SDK.
- [memfaultd] Now implements exponential back-off for uploads. Requests originating from this exponential back-off system do not interfere with the regular upload interval.
- [memfaultd] Sets persisted flag to disable data collection and returns
immediately:
memfaultd --disable-data-collection
. - [memfaultd] The
builtin.json
configuration file now features a link to documentation for reference. - Improved the top-level
README.md
with a feature and architecture overview.
- [memfaultd] The
--enable-data-collection
flag was not working reliably. - [memfaultd] A parsing bug going through the output of
memfault-device-info
.
During start-up of the memfaultd
service, you may see a log line in the output
of journalctl --unit memfaultd
:
memfaultd.service: Can't open PID file /run/memfaultd.pid (yet?) after start: Operation not permitted
This file is only used by systemd
during service shut-down and its absence
during start-up does not affect the functioning of the daemon. A fix is planned
for a future release. See this report on the Ubuntu nginx
package for a discussion on the topic.
0.1.0 - 2022-07-27
- [memfaultd] Support reporting reboot reasons.
- [memfaultd] Support OTA updates via SWUpdate.
- A memfaultd layer for Yocto (meta-memfault).
- An example Yocto image using memfaultd and the features above (meta-memfault-example).