瀏覽代碼

perf bench: Fix memcpy benchmark for large sizes

The glibc calloc() function has an optimization to not explicitely
memset() very large calloc allocations that just came from mmap(),
because they are known to be zero.

This could result in the perf memcpy benchmark reading only from
the zero page, which gives unrealistic results.

Always call memset explicitly on the source area to avoid this problem.

Signed-off-by: Andi Kleen <ak@linux.intel.com>
Cc: Hitoshi Mitake <h.mitake@gmail.com>
Cc: Kirill A. Shutemov <kirill.shutemov@linux.intel.com>
Link: http://lkml.kernel.org/n/tip-pzz2qrdq9eymxda0y8yxdn33@git.kernel.org
Signed-off-by: Arnaldo Carvalho de Melo <acme@redhat.com>
Andi Kleen 12 年之前
父節點
當前提交
a198996c7a
共有 1 個文件被更改,包括 2 次插入0 次删除
  1. 2 0
      tools/perf/bench/mem-memcpy.c

+ 2 - 0
tools/perf/bench/mem-memcpy.c

@@ -117,6 +117,8 @@ static void alloc_mem(void **dst, void **src, size_t length)
 	*src = zalloc(length);
 	*src = zalloc(length);
 	if (!*src)
 	if (!*src)
 		die("memory allocation failed - maybe length is too large?\n");
 		die("memory allocation failed - maybe length is too large?\n");
+	/* Make sure to always replace the zero pages even if MMAP_THRESH is crossed */
+	memset(*src, 0, length);
 }
 }
 
 
 static u64 do_memcpy_cycle(memcpy_t fn, size_t len, bool prefault)
 static u64 do_memcpy_cycle(memcpy_t fn, size_t len, bool prefault)