[PATCH v3 00/11] memcg: per cgroup dirty page accounting

Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]
From: Greg Thelen
Date: Monday, October 18, 2010 - 5:39 pm

Changes since v2:
- Rather than disabling softirq in lock_page_cgroup(), introduce a separate lock
  to synchronize between memcg page accounting and migration.  This only affects
  patch 4 of the series.  Patch 4 used to disable softirq, now it introduces the
  new lock.

Changes since v1:
- Renamed "nfs"/"total_nfs" to "nfs_unstable"/"total_nfs_unstable" in per cgroup
  memory.stat to match /proc/meminfo.
- Avoid lockdep warnings by using rcu_read_[un]lock() in
  mem_cgroup_has_dirty_limit().
- Fixed lockdep issue in mem_cgroup_read_stat() which is exposed by these
  patches.
- Remove redundant comments.
- Rename (for clarity):
  - mem_cgroup_write_page_stat_item -> mem_cgroup_page_stat_item
  - mem_cgroup_read_page_stat_item -> mem_cgroup_nr_pages_item
- Renamed newly created proc files:
  - memory.dirty_bytes -> memory.dirty_limit_in_bytes
  - memory.dirty_background_bytes -> memory.dirty_background_limit_in_bytes
- Removed unnecessary get_ prefix from get_xxx() functions.
- Allow [kKmMgG] suffixes for newly created dirty limit value cgroupfs files.
- Disable softirq rather than hardirq in lock_page_cgroup()
- Made mem_cgroup_move_account_page_stat() inline.
- Ported patches to mmotm-2010-10-13-17-13.

This patch set provides the ability for each cgroup to have independent dirty
page limits.

Limiting dirty memory is like fixing the max amount of dirty (hard to reclaim)
page cache used by a cgroup.  So, in case of multiple cgroup writers, they will
not be able to consume more than their designated share of dirty pages and will
be forced to perform write-out if they cross that limit.

The patches are based on a series proposed by Andrea Righi in Mar 2010.


Overview:
- Add page_cgroup flags to record when pages are dirty, in writeback, or nfs
  unstable.

- Extend mem_cgroup to record the total number of pages in each of the 
  interesting dirty states (dirty, writeback, unstable_nfs).  

- Add dirty parameters similar to the system-wide  /proc/sys/vm/dirty_*
  limits to mem_cgroup.  The mem_cgroup dirty parameters are accessible
  via cgroupfs control files.

- Consider both system and per-memcg dirty limits in page writeback when
  deciding to queue background writeback or block for foreground writeback.


Known shortcomings:
- When a cgroup dirty limit is exceeded, then bdi writeback is employed to
  writeback dirty inodes.  Bdi writeback considers inodes from any cgroup, not
  just inodes contributing dirty pages to the cgroup exceeding its limit.  


Performance data:
- A page fault microbenchmark workload was used to measure performance, which
  can be called in read or write mode:
        f = open(foo. $cpu)
        truncate(f, 4096)
        alarm(60)
        while (1) {
                p = mmap(f, 4096)
                if (write)
			*p = 1
		else
			x = *p
                munmap(p)
        }

- The workload was called for several points in the patch series in different
  modes:
  - s_read is a single threaded reader
  - s_write is a single threaded writer
  - p_read is a 16 thread reader, each operating on a different file
  - p_write is a 16 thread writer, each operating on a different file

- Measurements were collected on a 16 core non-numa system using "perf stat
  --repeat 3".  The -a option was used for parallel (p_*) runs.

- All numbers are page fault rate (M/sec).  Higher is better.

- To compare the performance of a kernel without non-memcg compare the first and
  last rows, neither has memcg configured.  The first row does not include any
  of these memcg patches.

- To compare the performance of using memcg dirty limits, compare the baseline
  (2nd row titled "w/ memcg") with the the code and memcg enabled (2nd to last
  row titled "all patches").

                           root_cgroup                     child_cgroup
                 s_read s_write p_read p_write    s_read s_write p_read p_write
mmotm w/o memcg   0.424  0.400   0.420  0.395
w/ memcg          0.419  0.390   0.395  0.371      0.413  0.385   0.385  0.361
all patches       0.421  0.384   0.395  0.362      0.418  0.380   0.396  0.360
all patches       0.425  0.396   0.423  0.388
  w/o memcg


Balbir Singh (1):
  memcg: CPU hotplug lockdep warning fix

Greg Thelen (9):
  memcg: add page_cgroup flags for dirty page tracking
  memcg: document cgroup dirty memory interfaces
  memcg: create extensible page stat update routines
  memcg: add dirty page accounting infrastructure
  memcg: add kernel calls for memcg dirty page stats
  memcg: add dirty limits to mem_cgroup
  memcg: add cgroupfs interface to memcg dirty limits
  writeback: make determine_dirtyable_memory() static.
  memcg: check memcg dirty limits in page writeback

KAMEZAWA Hiroyuki (1):
  memcg: add lock to synchronize page accounting and migration

 Documentation/cgroups/memory.txt |   60 ++++++
 fs/nfs/write.c                   |    4 +
 include/linux/memcontrol.h       |   78 +++++++-
 include/linux/page_cgroup.h      |   54 +++++-
 include/linux/writeback.h        |    2 -
 mm/filemap.c                     |    1 +
 mm/memcontrol.c                  |  417 ++++++++++++++++++++++++++++++++++++--
 mm/page-writeback.c              |  213 +++++++++++++-------
 mm/rmap.c                        |    4 +-
 mm/truncate.c                    |    1 +
 10 files changed, 726 insertions(+), 108 deletions(-)
--
Previous message: [thread] [date] [author]
Next message: [thread] [date] [author]

Messages in current thread:
[PATCH v3 00/11] memcg: per cgroup dirty page accounting, Greg Thelen, (Mon Oct 18, 5:39 pm)
[PATCH v3 07/11] memcg: add dirty limits to mem_cgroup, Greg Thelen, (Mon Oct 18, 5:39 pm)
Re: [PATCH v3 04/11] memcg: add lock to synchronize page a ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:45 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:46 pm)
Re: [PATCH v3 03/11] memcg: create extensible page stat up ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:47 pm)
Re: [PATCH v3 05/11] memcg: add dirty page accounting infr ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:49 pm)
Re: [PATCH v3 06/11] memcg: add kernel calls for memcg dir ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:51 pm)
Re: [PATCH v3 07/11] memcg: add dirty limits to mem_cgroup, KAMEZAWA Hiroyuki, (Mon Oct 18, 5:53 pm)
Re: [PATCH v3 08/11] memcg: CPU hotplug lockdep warning fix, KAMEZAWA Hiroyuki, (Mon Oct 18, 5:54 pm)
Re: [PATCH v3 09/11] memcg: add cgroupfs interface to memc ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:56 pm)
Re: [PATCH v3 10/11] writeback: make determine_dirtyable_m ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 5:57 pm)
Re: [PATCH v3 11/11] memcg: check memcg dirty limits in pa ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 6:00 pm)
Re: [PATCH v3 01/11] memcg: add page_cgroup flags for dirt ..., Daisuke Nishimura, (Mon Oct 18, 9:31 pm)
[RFC][PATCH 1/2] memcg: move_account optimization by redu ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 9:43 pm)
[RFC][PATCH 2/2] memcg: move_account optimization by redu ..., KAMEZAWA Hiroyuki, (Mon Oct 18, 9:45 pm)
Re: [PATCH v3 03/11] memcg: create extensible page stat up ..., Daisuke Nishimura, (Mon Oct 18, 9:52 pm)
Re: [PATCH v3 04/11] memcg: add lock to synchronize page a ..., Daisuke Nishimura, (Mon Oct 18, 10:03 pm)
Re: [PATCH v3 06/11] memcg: add kernel calls for memcg dir ..., Daisuke Nishimura, (Tue Oct 19, 12:03 am)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., Daisuke Nishimura, (Tue Oct 19, 1:27 am)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 5:11 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., Daisuke Nishimura, (Tue Oct 19, 5:48 pm)
Re: [PATCH v3 07/11] memcg: add dirty limits to mem_cgroup, Daisuke Nishimura, (Tue Oct 19, 5:50 pm)
Re: [PATCH v3 05/11] memcg: add dirty page accounting infr ..., Daisuke Nishimura, (Tue Oct 19, 5:53 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 6:14 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 7:24 pm)
[PATCH][memcg+dirtylimit] Fix overwriting global vm dirty ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 8:21 pm)
Re: [PATCH v3 09/11] memcg: add cgroupfs interface to memc ..., Daisuke Nishimura, (Tue Oct 19, 8:31 pm)
Re: [PATCH v3 09/11] memcg: add cgroupfs interface to memc ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 8:44 pm)
Re: [PATCH v3 09/11] memcg: add cgroupfs interface to memc ..., Daisuke Nishimura, (Tue Oct 19, 8:46 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., Daisuke Nishimura, (Tue Oct 19, 8:47 pm)
Re: [PATCH v3 10/11] writeback: make determine_dirtyable_m ..., Daisuke Nishimura, (Tue Oct 19, 8:47 pm)
Re: [PATCH v3 08/11] memcg: CPU hotplug lockdep warning fix, Daisuke Nishimura, (Tue Oct 19, 8:47 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 9:06 pm)
Re: [PATCH][memcg+dirtylimit] Fix overwriting global vm d ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 9:14 pm)
Re: [PATCH v3 11/11] memcg: check memcg dirty limits in pa ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 9:18 pm)
Re: [PATCH v3 02/11] memcg: document cgroup dirty memory i ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 9:26 pm)
Re: [PATCH v3 11/11] memcg: check memcg dirty limits in pa ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 9:33 pm)
Re: [PATCH v3 11/11] memcg: check memcg dirty limits in pa ..., Daisuke Nishimura, (Tue Oct 19, 9:34 pm)
[PATCH v2][memcg+dirtylimit] Fix overwriting global vm di ..., KAMEZAWA Hiroyuki, (Tue Oct 19, 10:02 pm)
Re: [PATCH v3 11/11] memcg: check memcg dirty limits in pa ..., Daisuke Nishimura, (Tue Oct 19, 10:25 pm)
Re: [PATCH v2][memcg+dirtylimit] Fix overwriting global v ..., Daisuke Nishimura, (Tue Oct 19, 11:09 pm)
Re: [PATCH v2][memcg+dirtylimit] Fix overwriting global v ..., KAMEZAWA Hiroyuki, (Wed Oct 20, 5:10 pm)
Re: [PATCH v2][memcg+dirtylimit] Fix overwriting global v ..., KAMEZAWA Hiroyuki, (Sun Oct 24, 5:24 pm)
Re: [PATCH v2][memcg+dirtylimit] Fix overwriting global v ..., Daisuke Nishimura, (Sun Oct 24, 7:00 pm)
Re: [PATCH v2][memcg+dirtylimit] Fix overwriting global v ..., KAMEZAWA Hiroyuki, (Mon Oct 25, 12:08 am)