wazero

Author	SHA1	Message	Date
Edoardo Vacchi	ea63499005	compiler(arm64): avoid conditional jumps to the trap handler (#1524 ) Signed-off-by: Edoardo Vacchi <evacchi@users.noreply.github.com>	2023-06-19 19:18:18 +10:00
Edoardo Vacchi	f385873239	compiler(arm64): generate trap exit code once (#1516 ) Signed-off-by: Edoardo Vacchi <evacchi@users.noreply.github.com>	2023-06-15 08:35:51 +10:00
Anuraag Agrawal	62f8109101	compiler: fix compileMemoryAccessOffsetSetup comment (#1504 ) Signed-off-by: Anuraag Agrawal <anuraaga@gmail.com>	2023-06-05 16:16:23 +10:00
Achille	9780f0f4a0	compiler: zero-copy code assembly (#1481 ) Signed-off-by: Achille Roussel <achille.roussel@gmail.com> Co-authored-by: Crypt Keeper <64215+codefromthecrypt@users.noreply.github.com>	2023-05-19 07:06:30 +02:00
Takeshi Yoneda	867459d7d5	compiler: mmap per module instead of per function (#1377 ) This changes the mmap strategy used in the compiler backend. Previously, we used mmap syscall once per function and allocated the executable pages each time. Basically, mmap can only allocate the boundary of the page size of the underlying os. Even if the requested executable is smaller than the page size, the entire page is marked as executable and won't be reused by Go runtime. Therefore, we wasted roughly `(len(body)%osPageSize)*function`. Even though we still need to align each function on 16 bytes boundary when mmaping per module, the wasted space is much smaller than before. The following benchmark results shows that this improves the overall compilation performance while showing the heap usage increased. However, the increased heap usage is totally offset by the hidden wasted memory page which is not measured by Go's -benchmem. Actually, when I did the experiments, I observed that roughly 20~30mb are wasted on arm64 previously which is larger than the increased heap usage in this result. More importantly, this increased heap usage is a target of GC and should be ignorable in the long-running program vs the wasted page is persistent until the CompiledModule is closed. Not only the actual compilation time, the result indicates that this could improve the overall Go runtime's performance maybe thanks to not abusing runtime.Finalizer since you can see this improves the subsequent interpreter benchmark results. ``` goos: darwin goarch: arm64 pkg: github.com/tetratelabs/wazero/internal/integration_test/bench │ old.txt │ new.txt │ │ sec/op │ sec/op vs base │ Compilation_sqlite3/compiler-10 183.4m ± 0% 175.9m ± 2% -4.10% (p=0.001 n=7) Compilation_sqlite3/interpreter-10 61.59m ± 0% 59.57m ± 0% -3.29% (p=0.001 n=7) geomean 106.3m 102.4m -3.69% │ old.txt │ new.txt │ │ B/op │ B/op vs base │ Compilation_sqlite3/compiler-10 42.93Mi ± 0% 54.33Mi ± 0% +26.56% (p=0.001 n=7) Compilation_sqlite3/interpreter-10 51.75Mi ± 0% 51.75Mi ± 0% -0.01% (p=0.001 n=7) geomean 47.13Mi 53.02Mi +12.49% │ old.txt │ new.txt │ │ allocs/op │ allocs/op vs base │ Compilation_sqlite3/compiler-10 26.07k ± 0% 26.06k ± 0% ~ (p=0.149 n=7) Compilation_sqlite3/interpreter-10 13.90k ± 0% 13.90k ± 0% ~ (p=0.421 n=7) geomean 19.03k 19.03k -0.02% goos: linux goarch: amd64 pkg: github.com/tetratelabs/wazero/internal/integration_test/bench cpu: AMD Ryzen 9 3950X 16-Core Processor │ old.txt │ new.txt │ │ sec/op │ sec/op vs base │ Compilation_sqlite3/compiler-32 384.4m ± 2% 373.0m ± 4% -2.97% (p=0.001 n=7) Compilation_sqlite3/interpreter-32 86.09m ± 4% 65.05m ± 2% -24.44% (p=0.001 n=7) geomean 181.9m 155.8m -14.38% │ old.txt │ new.txt │ │ B/op │ B/op vs base │ Compilation_sqlite3/compiler-32 49.40Mi ± 0% 59.91Mi ± 0% +21.29% (p=0.001 n=7) Compilation_sqlite3/interpreter-32 51.77Mi ± 0% 51.76Mi ± 0% -0.02% (p=0.001 n=7) geomean 50.57Mi 55.69Mi +10.12% │ old.txt │ new.txt │ │ allocs/op │ allocs/op vs base │ Compilation_sqlite3/compiler-32 28.70k ± 0% 28.70k ± 0% ~ (p=0.925 n=7) Compilation_sqlite3/interpreter-32 14.00k ± 0% 14.00k ± 0% -0.04% (p=0.010 n=7) geomean 20.05k 20.04k -0.02% ``` resolves #1060 Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-26 14:11:37 +09:00
Takeshi Yoneda	d9e5d6b0d3	Avoids unnecessary allocations during mmap executables (#1366 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-17 14:03:35 +09:00
Takeshi Yoneda	b6d19696da	compiler: reuses allocated runtimeValueLocation stacks (#1348 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-10 14:58:47 +09:00
Takeshi Yoneda	f167939c88	wazeroir: avoids allocation with InclusiveRange (#1345 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-07 12:14:39 +09:00
Takeshi Yoneda	24dbf49c79	compiler: avoid alloc with stack pointer ceil and reuse bytes reader (#1344 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-06 16:47:08 +09:00
Takeshi Yoneda	cc28399052	wazeroir: reuses allocated slices for a module (#1342 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-05 20:26:44 +09:00
Edoardo Vacchi	0dc152d672	wazeroir: migrate vector, table, branch and all other remaining ops to compact repr (#1334 ) Signed-off-by: Edoardo Vacchi <evacchi@users.noreply.github.com> Co-authored-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-04-05 09:38:49 +09:00
Edoardo Vacchi	f3ef84c9b3	wazeroir: Load Ops, Store Ops, Set, Pick, Select, CallIndirect (#1329 ) Signed-off-by: Edoardo Vacchi <evacchi@users.noreply.github.com> Co-authored-by: Takeshi Yoneda <t.y.mathetake@gmail.com>	2023-04-01 08:00:27 +09:00
Takeshi Yoneda	b9babda2d4	compiler: stop using map for label infos (#1327 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-31 15:14:16 +09:00
Takeshi Yoneda	cee1e50ad2	compiler: avoids allocation in label resolution (#1326 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-30 22:26:18 -07:00
Takeshi Yoneda	bbae781978	compiler: avoid allocation in SetJumpTargetOnNext (#1325 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-31 13:03:26 +09:00
Takeshi Yoneda	276d7015df	compiler: removes takeFreeRegisters fn (#1324 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-31 12:34:41 +09:00
Edoardo Vacchi	8887799da7	wazeroir: move unary byte ops to UnionOperator (#1320 ) Signed-off-by: Edoardo Vacchi <evacchi@users.noreply.github.com>	2023-03-31 08:07:12 +09:00
Edoardo Vacchi	c5d37877bd	wazeroir: migrate unary operations to UnionOperation (#1318 ) * refactor: OperationCall, OperationGlobalGet, OperationGlobalSet * refactor: Constant Operations Signed-off-by: Edoardo Vacchi <evacchi@users.noreply.github.com> ---------	2023-03-30 12:57:45 +02:00
Takeshi Yoneda	0857336746	Removes wasm.FunctionInstance type (#1294 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-28 14:43:44 +09:00
Takeshi Yoneda	350e81e632	Holds function types as values, not ptrs in wasm.Module (#1227 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-15 13:45:52 +09:00
Takeshi Yoneda	ad968dc3fe	Pass correct api.Module to host functions (#1213 ) Fixes #1211 Previously, host functions are getting api.Module for the "originating" module, which is the module for api.Function currently invoked, except that the api.Module is modified by withMemory with the caller's memory instance, therefore there haven't been no problem for most cases. The only issues were the methods besides Memory() of api.Module, and this commit fixes them. Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-03-08 15:05:48 +09:00
Takeshi Yoneda	5eab1a7307	compiler: pass runtimeValueLocationStack by values to reduce allocations (#1170 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-02-27 09:42:45 +09:00
Takeshi Yoneda	0547a8e0ab	wazeroir: uses u64 as unique keys for labels (#1142 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-02-21 08:24:59 +09:00
Takeshi Yoneda	381f26db20	wazeroir: less allocations with Labels (#1141 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-02-20 17:09:17 +09:00
Takeshi Yoneda	a265d41d30	wazeroir: less allocations (#1138 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-02-20 13:29:56 +09:00
Takeshi Yoneda	92e0a488b6	Support context Cancelation/Timeout in function executions (#1108 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-02-10 09:39:04 +09:00
Crypt Keeper	8918d73020	ci: supports building with Go 1.20 and raises floor version to 1.18 (#1096 ) This moves our floor version to the same we'll release 1.0 with: 1.18. This is congruent with our version policy which is current-2. Fixes #921 Signed-off-by: Adrian Cole <adrian@tetrate.io>	2023-02-06 17:29:08 +02:00
Takeshi Yoneda	6eb1e5d9fb	arm64: fixes runtimeValueType for i32.wrap_i64 (#1011 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2023-01-06 14:28:48 +09:00
Takeshi Yoneda	e7018d19ff	compiler: force moduleContext initialization after Go function calls (#988 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-12-31 14:05:30 +09:00
Crypt Keeper	921df7e7a6	cli: adds -hostlogging=filesystem for diagnosing problems (#966 ) Signed-off-by: Adrian Cole <adrian@tetrate.io>	2022-12-28 11:38:24 +08:00
Takeshi Yoneda	d63c747d53	asm,compiler: reduce allocations during compilation (#936 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-12-20 12:49:47 +09:00
Takeshi Yoneda	0dde445074	compiler: make moduleContext.functions non-pointer slice (#898 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-12-07 15:20:24 +09:00
Takeshi Yoneda	6c4dd1cfd9	Adds support for DWARF based stack traces (#881 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-12-05 14:59:45 +09:00
Takeshi Yoneda	84d733eb0a	compiler: adds support for FunctionListeners (#869 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-11-29 15:15:49 +09:00
Crypt Keeper	329ccca6b1	Switches from gofmt to gofumpt (#848 ) This switches to gofumpt and applies changes, as I've noticed working in dapr (who uses this) that it finds some things that are annoying, such as inconsistent block formatting in test tables. Signed-off-by: Adrian Cole <adrian@tetrate.io>	2022-11-09 05:48:24 +01:00
Crypt Keeper	1cbb496c26	Stops using "ex." to abbreviate "for example" (#827 ) Signed-off-by: Adrian Cole <adrian@tetrate.io>	2022-10-24 11:51:48 +09:00
Takeshi Yoneda	9ad8af121a	compiler: simplify calling convention (#782 ) This simplifies the calling convention and consolidates the call frame stack and value stack into a single stack. As a result, the cost of function calls decreases because we now don't need to check the boundary twice (value and call frame stacks) at each function call. The following is the result of the benchmark for recursive Fibonacci function in integration_test/bench/testdata/case.go, and it shows that this actually improves the performance of function calls. [amd64] name old time/op new time/op delta Invocation/compiler/fib_for_5-32 109ns ± 3% 81ns ± 1% -25.86% (p=0.008 n=5+5) Invocation/compiler/fib_for_10-32 556ns ± 3% 473ns ± 3% -14.99% (p=0.008 n=5+5) Invocation/compiler/fib_for_20-32 61.4µs ± 2% 55.9µs ± 5% -8.98% (p=0.008 n=5+5) Invocation/compiler/fib_for_30-32 7.41ms ± 3% 6.83ms ± 3% -7.90% (p=0.008 n=5+5) [arm64] name old time/op new time/op delta Invocation/compiler/fib_for_5-10 67.7ns ± 1% 60.2ns ± 1% -11.12% (p=0.000 n=9+9) Invocation/compiler/fib_for_10-10 487ns ± 1% 460ns ± 0% -5.56% (p=0.000 n=10+9) Invocation/compiler/fib_for_20-10 58.0µs ± 1% 54.3µs ± 1% -6.38% (p=0.000 n=10+10) Invocation/compiler/fib_for_30-10 7.12ms ± 1% 6.67ms ± 1% -6.31% (p=0.000 n=10+9) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-09-06 13:29:56 +09:00
Takeshi Yoneda	15a774e543	compiler: hold stack base pointers in bytes, not index to the stack slice (#781 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-08-30 13:00:55 +09:00
Takeshi Yoneda	17b8591dcc	Change OperationSwap to OperationSet (#775 ) This commit changes wazeroir.OperationSwap to OperationSet which is the combination of OperationSwap and Drop in the previous implementation. Previously, OperationSwap was always followed by OperationDrop on the swapped value on top. Because of that, we had a redundant register allocation in Swap. As a result, we use only one register in OperationSet which is a part of translations of local.tee and local.set. Signed-off-by: Takeshi Yoneda takeshi@tetrate.io	2022-08-29 15:28:39 +09:00
Takeshi Yoneda	3b32c2028b	Externalize compilation cache by compilers (#747 ) This adds the experimental support of the file system compilation cache. Notably, experimental.WithCompilationCacheDirName allows users to configure where the compiler writes the cache into. Versioning/validation of binary compatibility has been done via the release tag (which will be created from the end of this month). More specifically, the cache file starts with a header with the hardcoded wazero version. Fixes #618 Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io> Co-authored-by: Crypt Keeper <64215+codefromthecrypt@users.noreply.github.com>	2022-08-18 19:37:11 +09:00
Takeshi Yoneda	fd05b9610c	arm64: do not load/store higher bits of 32-bit int/float (#743 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-08-12 16:09:37 +08:00
Takeshi Yoneda	ef19028513	Fixes function calls / memory access with wasm-implemented host functions (#737 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-08-07 09:50:34 +08:00
Takeshi Yoneda	b18b36c173	compiler: support for Wasm-implemented host functinos (#728 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-30 11:33:03 +09:00
Takeshi Yoneda	c5daf5a218	interpreter,compiler(arm64): clears higher bits in i32.load_8/16_s (#725 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-28 15:45:07 +09:00
Takeshi Yoneda	d1336806d6	compiler: save conditional values at data.drop (#724 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-27 17:18:42 +09:00
Takeshi Yoneda	a064f68532	compiler: allow memory access after table.grow (#721 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-27 09:56:48 +09:00
Takeshi Yoneda	d15cc069c6	compiler: save conditional value on elem.drop (#722 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-27 09:53:19 +09:00
Takeshi Yoneda	143b2be398	compiler: save conditional values at ref.func (#717 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-25 13:58:15 +09:00
Takeshi Yoneda	8d75403c49	compiler: save conditional values at table.size (#715 ) Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-25 13:32:32 +09:00
Takeshi Yoneda	ed068597cd	ci: adds Go 1.19.0-rc.2 into matrix (#714 ) This adds the 1.19.0-rc2. in the testing matrix. This also formats the Godocs across the codebase, as Go 1.19 has started auto-formatting Godoc. https://github.com/tetratelabs/wazero/issues/426 Signed-off-by: Takeshi Yoneda <takeshi@tetrate.io>	2022-07-25 11:14:00 +09:00

1 2

71 Commits