Read /proc/net files with a single read syscall. #361

ablagoev · 2017-05-03T13:03:36Z

The /proc/net files are not guaranteed to be consistent, they are only
consitent on the row level. This is probably one of the reasons why
consequent read calls might return duplicate entries - the kernel is
changing the file as it is being read. In certain situations this might
lead to loop like situations - the same net entry is being returned by consequent read calls as new connections are added to the kernel tcp table.

This PR is trying to reduce the duplications, by fetching the contents
of the net files with a single read syscall.

This discussion on Stackoverflow goes into more detail on how the /proc files work in terms of consistency.

In our use case, there were certain situations where the netstat telegraf input plugin spiked eating up a lot of memory. On further inspection it appeared that this was due to a lot of entries in the /proc/net/tcp file. These entries, however, did not correspond to higher tcp connections - they remained within the expected range. This is what leads me to believe, that in certain situations loop like conditions are created and a lot of duplicate rows appear in /proc/net/tcp.

Consider the following graphs:

The memory in the graph is the memory taken from the netstat telegraf plugin. You can see how spikes in the /proc/net/tcp length correspond to spikes in netstat's memory usage. Additionally, in those spikes, there are no real spikes in tcp connection counts. The length of /proc/net/tcp was gathered separately using cat /proc/net/tcp. cat itself issues many read calls to the file, so it has the same behavior as the netstat plugin.

After the patch the graphs look like this:

There are no more spikes in memory.

Slightly more CPU is consumed, but this is compensated with the reduction of duplicates and stabilization of the memory used. For smaller use cases I don't think CPU usage will be an issue, in higher ones overall single reads yields much better performance.
In our use case we are observing a 56% improvement in the time required to gather the netstat data:

The /proc/net files are not guaranteed to be consistent, they are only consitent on the row level. This is probably one of the reasons why consequent read calls might return duplicate entries - the kernel is changing the file as it is being read. In certain situations this might lead to loop like situations - the same net entry is being returned when reading the file as new connections are added to the kernel tcp table, i.e there can be a lot of duplications. This commit is trying to reduce the duplications, by fetching the contents of the net files with a single read syscall.

shirou · 2017-05-04T11:39:15Z

So cool! Thank you. Could you add some comments about why not to use common.ReadLines
with this PR URL?

shirou · 2017-05-04T12:34:55Z

Thank you for your contribution!

As per shirou#361

zzyongx · 2019-04-10T02:53:59Z

ioutil.ReadFile is still slow on centos-6, when /proc/net/tcp is bigger than 30M. But it's not go's problem, use cat is slow too.

Lomanic · 2019-05-28T17:26:19Z

@zzyongx see #695 for how gopsutil could improve performance using netlink API.

Add comments with a short explanation and link to the PR request

b32353f

shirou merged commit f7c38fa into shirou:master May 4, 2017

apoorv007 added a commit to archsaber/gopsutil that referenced this pull request Jun 24, 2017

Read /proc/net files with a single read syscall.

53b6dfd

As per shirou#361

wcc526 mentioned this pull request May 29, 2019

Read /proc/net files with a single read syscall. prometheus/node_exporter#1360

Closed

cforce mentioned this pull request Jul 4, 2024

hostmetrics featuring ebpf - resource efficient scraping open-telemetry/opentelemetry-collector-contrib#32446

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read /proc/net files with a single read syscall. #361

Read /proc/net files with a single read syscall. #361

ablagoev commented May 3, 2017

shirou commented May 4, 2017

shirou commented May 4, 2017

zzyongx commented Apr 10, 2019

Lomanic commented May 28, 2019

Read /proc/net files with a single read syscall. #361

Read /proc/net files with a single read syscall. #361

Conversation

ablagoev commented May 3, 2017

shirou commented May 4, 2017

shirou commented May 4, 2017

zzyongx commented Apr 10, 2019

Lomanic commented May 28, 2019