mm: avoid false sharing of mm_counter
Considering the nature of per mm stats, it's the shared object among threads and can be a cache-miss point in the page fault path. This patch adds per-thread cache for mm_counter. RSS value will be counted into a struct in task_struct and synchronized with mm's one at events. Now, in this patch, the event is the number of calls to handle_mm_fault. Per-thread value is added to mm at each 64 calls. rough estimation with small benchmark on parallel thread (2threads) shows [before] 4.5 cache-miss/faults [after] 4.0 cache-miss/faults Anyway, the most contended object is mmap_sem if the number of threads grows. [akpm@linux-foundation.org: coding-style fixes] Signed-off-by:KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com> Cc: Minchan Kim <minchan.kim@gmail.com> Cc: Christoph Lameter <cl@linux-foundation.org> Cc: Lee Schermerhorn <lee.schermerhorn@hp.com> Cc: David Rientjes <rientjes@google.com> Signed-off-by:
Andrew Morton <akpm@linux-foundation.org> Signed-off-by:
Linus Torvalds <torvalds@linux-foundation.org>
Showing
- Documentation/filesystems/proc.txt 6 additions, 0 deletionsDocumentation/filesystems/proc.txt
- fs/exec.c 1 addition, 0 deletionsfs/exec.c
- include/linux/mm.h 3 additions, 5 deletionsinclude/linux/mm.h
- include/linux/mm_types.h 6 additions, 0 deletionsinclude/linux/mm_types.h
- include/linux/sched.h 3 additions, 1 deletioninclude/linux/sched.h
- kernel/exit.c 2 additions, 1 deletionkernel/exit.c
- mm/memory.c 86 additions, 8 deletionsmm/memory.c
Loading
Please register or sign in to comment