Commit Graph

753 Commits

Author SHA1 Message Date
JanLJL
44f3c0376e version bump 2021-07-21 02:41:05 +02:00
JanLJL
a721d0941d added more load instrs 2021-07-21 02:34:31 +02:00
JanLJL
2d17a48604 updated a few DB entries 2021-06-02 16:37:18 +02:00
JanLJL
98de6f2a92 version bump 2021-06-01 00:13:38 +02:00
JanLJL
ec771dbe91 fixed parsing of reg ranges and lists 2021-06-01 00:10:05 +02:00
JanLJL
8cc36e6691 version bump 2021-05-10 12:56:35 +02:00
JanLJL
5038ce7a15 changed immediate type from str to int 2021-05-10 01:12:30 +02:00
JanLJL
261ad22ccb version bump 2021-05-05 11:16:43 +02:00
JanLJL
8e09927178 added tests for timeout in LCD analyis 2021-05-02 22:48:22 +02:00
JanLJL
e988251c49 fixed bug in case of no uarch in CLI 2021-05-02 22:39:07 +02:00
JanLJL
30d6f4d737 fix #73 2021-05-02 22:22:30 +02:00
Julian Hammer
5ac87b3475 added missing testfile for sve instructions 2021-05-02 21:44:17 +02:00
Julian Hammer
31e35d8815 addd LD2 and ST2 instructions to a64fx 2021-04-23 13:33:32 +02:00
Julian Hammer
c4163dd930 Merge branch 'master' of github.com:RRZE-HPC/OSACA 2021-04-23 13:18:23 +02:00
Julian Hammer
781b8b6b89 improved register range and list support on AArch64 2021-04-23 13:12:18 +02:00
JanLJL
129052a84a fixed incompatibilty to py3.6 2021-04-20 13:59:56 +02:00
JanLJL
c1234e2d45 set min requirement to py3.6 2021-04-20 13:59:32 +02:00
JanLJL
08b29e93a0 added wheel to pypi publishing 2021-04-19 11:33:29 +02:00
JanLJL
5ef7aa3466 fixed duplicate hyperlink tags 2021-04-19 10:58:11 +02:00
JanLJL
730132881d added no timeout option 2021-04-19 10:57:51 +02:00
JanLJL
e93672d5ff version bump 2021-04-19 10:14:26 +02:00
JanLJL
af947c54f5 Merge branch 'master' of https://github.com/RRZE-HPC/OSACA 2021-04-19 00:34:32 +02:00
JanLJL
b2af6a61c6 bugfixed CLX as uarch flag 2021-04-19 00:34:21 +02:00
Jan
e139833eab added lcd-timeout flag, citations and updated credits 2021-04-19 00:27:24 +02:00
JanLJL
6208536863 added CLX as synonym for CSX uarch 2021-04-19 00:05:53 +02:00
JanLJL
5b95f1f909 enhanced LCD analysis by making it parallel and added timeout flag 2021-04-19 00:04:03 +02:00
JanLJL
3d580960b6 keep dependency paths as generators instead of lists 2021-04-17 12:46:44 +02:00
JanLJL
e3b08a2574 allow different base with prefix for offset values 2021-04-17 11:06:39 +02:00
Julian Hammer
f05c7fa637 version bump 2021-04-15 14:47:09 +02:00
Julian Hammer
c54685ee2c readme added to validation folder 2021-04-15 14:45:23 +02:00
Julian
04836cf3f9 Validation (#71)
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.

build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.

For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz

The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb

Quite a few changes on OSACA included:

Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
5990cdc2b4 added missing instructions to all DBs 2021-04-05 16:47:52 +02:00
JanLJL
7a0120205c changed consideration of masking for database back to NO 2021-03-25 11:50:17 +01:00
JanLJL
23b5b4f4d3 added new instructions and bugfixed the wrong import from uops.info for masks for A FEW instructions 2021-03-25 11:47:15 +01:00
Julian Hammer
48db31b430 flake8 to ignore some errors and small style improvements 2021-03-11 12:52:34 +01:00
Julian Hammer
0f8e53e918 making flake8 happy 2021-03-11 12:29:14 +01:00
Julian Hammer
1147ea78bf fixed output redirection 2021-03-11 12:17:46 +01:00
Julian Hammer
0be5632814 added blame ignore file 2021-03-11 12:03:40 +01:00
Julian Hammer
afa607e546 migrate code style to Black 2021-03-11 12:02:45 +01:00
Julian Hammer
f284ae84ca sanity check validity of operand entries 2021-03-11 11:38:25 +01:00
Julian Hammer
d786acad34 fixed typos and added missing default load/store info 2021-03-11 10:10:09 +01:00
JanLJL
f8af078d3d guarantee 0 latency for None values in DB 2021-03-11 01:55:57 +01:00
Julian Hammer
f7777954ad fix #66 typo in icl.yml
cmp failed with immediates
2021-03-10 11:57:21 +01:00
Jan
87cde04dbf Merge pull request #65 from RRZE-HPC/throw_parsing_error
passing parsing errors to the outside
2021-03-08 09:38:24 +01:00
Julian Hammer
c788e7b97b passing parsing errors to the outside 2021-03-05 18:07:36 +01:00
JanLJL
89aa82f9c8 changed test after adjustment in parser 2021-02-25 08:12:10 +01:00
JanLJL
70ddce85c9 fixed AArch64 parser for register shifts and new instructions for A64FX 2021-02-25 07:43:42 +01:00
Julian Hammer
fb1754b56d add "sub gpr, gpr" to skx 2021-02-15 11:41:21 +01:00
Julian Hammer
307ad73696 add "vmovapd zmm, zmm" to skx 2021-02-15 11:41:20 +01:00
JanLJL
36566589ae new instructions 2021-01-31 18:11:06 +01:00