浏览代码

iommu/iova: Optimise rbtree searching

Checking the IOVA bounds separately before deciding which direction to
continue the search (if necessary) results in redundantly comparing both
pfns twice each. GCC can already determine that the final comparison op
is redundant and optimise it down to 3 in total, but we can go one
further with a little tweak of the ordering (which makes the intent of
the code that much cleaner as a bonus).

Signed-off-by: Zhen Lei <thunder.leizhen@huawei.com>
Tested-by: Ard Biesheuvel <ard.biesheuvel@linaro.org>
Tested-by: Zhen Lei <thunder.leizhen@huawei.com>
Tested-by: Nate Watterson <nwatters@codeaurora.org>
[rm: rewrote commit message to clarify]
Signed-off-by: Robin Murphy <robin.murphy@arm.com>
Signed-off-by: Joerg Roedel <jroedel@suse.de>
Zhen Lei 7 年之前
父节点
当前提交
2070f940a6
共有 1 个文件被更改,包括 3 次插入6 次删除
  1. 3 6
      drivers/iommu/iova.c

+ 3 - 6
drivers/iommu/iova.c

@@ -342,15 +342,12 @@ private_find_iova(struct iova_domain *iovad, unsigned long pfn)
 	while (node) {
 	while (node) {
 		struct iova *iova = rb_entry(node, struct iova, node);
 		struct iova *iova = rb_entry(node, struct iova, node);
 
 
-		/* If pfn falls within iova's range, return iova */
-		if ((pfn >= iova->pfn_lo) && (pfn <= iova->pfn_hi)) {
-			return iova;
-		}
-
 		if (pfn < iova->pfn_lo)
 		if (pfn < iova->pfn_lo)
 			node = node->rb_left;
 			node = node->rb_left;
-		else if (pfn > iova->pfn_lo)
+		else if (pfn > iova->pfn_hi)
 			node = node->rb_right;
 			node = node->rb_right;
+		else
+			return iova;	/* pfn falls within iova's range */
 	}
 	}
 
 
 	return NULL;
 	return NULL;