Commit Graph

735 Commits

Author SHA1 Message Date
JanLJL
5ef7aa3466 fixed duplicate hyperlink tags 2021-04-19 10:58:11 +02:00
JanLJL
730132881d added no timeout option 2021-04-19 10:57:51 +02:00
JanLJL
e93672d5ff version bump 2021-04-19 10:14:26 +02:00
JanLJL
af947c54f5 Merge branch 'master' of https://github.com/RRZE-HPC/OSACA 2021-04-19 00:34:32 +02:00
JanLJL
b2af6a61c6 bugfixed CLX as uarch flag 2021-04-19 00:34:21 +02:00
Jan
e139833eab added lcd-timeout flag, citations and updated credits 2021-04-19 00:27:24 +02:00
JanLJL
6208536863 added CLX as synonym for CSX uarch 2021-04-19 00:05:53 +02:00
JanLJL
5b95f1f909 enhanced LCD analysis by making it parallel and added timeout flag 2021-04-19 00:04:03 +02:00
JanLJL
3d580960b6 keep dependency paths as generators instead of lists 2021-04-17 12:46:44 +02:00
JanLJL
e3b08a2574 allow different base with prefix for offset values 2021-04-17 11:06:39 +02:00
Julian Hammer
f05c7fa637 version bump 2021-04-15 14:47:09 +02:00
Julian Hammer
c54685ee2c readme added to validation folder 2021-04-15 14:45:23 +02:00
Julian
04836cf3f9 Validation (#71)
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.

build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.

For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz

The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb

Quite a few changes on OSACA included:

Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
5990cdc2b4 added missing instructions to all DBs 2021-04-05 16:47:52 +02:00
JanLJL
7a0120205c changed consideration of masking for database back to NO 2021-03-25 11:50:17 +01:00
JanLJL
23b5b4f4d3 added new instructions and bugfixed the wrong import from uops.info for masks for A FEW instructions 2021-03-25 11:47:15 +01:00
Julian Hammer
48db31b430 flake8 to ignore some errors and small style improvements 2021-03-11 12:52:34 +01:00
Julian Hammer
0f8e53e918 making flake8 happy 2021-03-11 12:29:14 +01:00
Julian Hammer
1147ea78bf fixed output redirection 2021-03-11 12:17:46 +01:00
Julian Hammer
0be5632814 added blame ignore file 2021-03-11 12:03:40 +01:00
Julian Hammer
afa607e546 migrate code style to Black 2021-03-11 12:02:45 +01:00
Julian Hammer
f284ae84ca sanity check validity of operand entries 2021-03-11 11:38:25 +01:00
Julian Hammer
d786acad34 fixed typos and added missing default load/store info 2021-03-11 10:10:09 +01:00
JanLJL
f8af078d3d guarantee 0 latency for None values in DB 2021-03-11 01:55:57 +01:00
Julian Hammer
f7777954ad fix #66 typo in icl.yml
cmp failed with immediates
2021-03-10 11:57:21 +01:00
Jan
87cde04dbf Merge pull request #65 from RRZE-HPC/throw_parsing_error
passing parsing errors to the outside
2021-03-08 09:38:24 +01:00
Julian Hammer
c788e7b97b passing parsing errors to the outside 2021-03-05 18:07:36 +01:00
JanLJL
89aa82f9c8 changed test after adjustment in parser 2021-02-25 08:12:10 +01:00
JanLJL
70ddce85c9 fixed AArch64 parser for register shifts and new instructions for A64FX 2021-02-25 07:43:42 +01:00
Julian Hammer
fb1754b56d add "sub gpr, gpr" to skx 2021-02-15 11:41:21 +01:00
Julian Hammer
307ad73696 add "vmovapd zmm, zmm" to skx 2021-02-15 11:41:20 +01:00
JanLJL
36566589ae new instructions 2021-01-31 18:11:06 +01:00
JanLJL
51e26b90d6 minor model fixes 2021-01-26 12:56:19 +01:00
JanLJL
cbcd868297 cover kernel with only unknown instructions 2021-01-07 12:40:16 +01:00
Jan
68f1602167 Merge pull request #62 from jdomke/attfix
att parser: workaround for crash with "jg,pt" mnemonic
For now we will ignore the branch taken/not-taken indication and will only keep the condition in the mnemonic.
2021-01-07 10:23:39 +01:00
Jan
ffcdee5ed9 suppress branch indication in mnemonic for now 2021-01-07 10:20:57 +01:00
JanLJL
5b85041bd5 fixed DV port bug 2021-01-07 09:59:18 +01:00
jdomke
eb1a8a2674 att parser: support for more mnemonic
found some 'jg,pt' in icc/mkl generated binaries which crashed the
parser, here an example:
 dd8ccd:   3e 7f 90   jg,pt  dd8c60 <mkl_blas_avx2_dtrsm_kernel...
2021-01-05 15:26:22 +09:00
JanLJL
7b83e4a370 fixes #61 2020-12-28 10:33:54 +01:00
Jan
427eebfbfe Changed badge target to GH Actions 2020-12-18 14:13:26 +01:00
Julian Hammer
c63a5f5c22 updated travis badge to gh action 2020-12-18 14:13:26 +01:00
Julian Hammer
d49f7838ee disabled py35 with tox 2020-12-18 14:13:26 +01:00
Julian Hammer
7b575f1d1c configured flake8 max-line-length 2020-12-18 14:13:26 +01:00
Julian Hammer
a6adb4bed8 thrid wheel was to much 2020-12-18 14:13:26 +01:00
Julian Hammer
2a2159e302 merged test and publish again 2020-12-18 14:13:26 +01:00
Julian Hammer
66ea9629ea added missing runs-on 2020-12-18 14:13:26 +01:00
Julian Hammer
e06823a568 installing libgraphviz-dev 2020-12-18 14:13:26 +01:00
Julian Hammer
aa618a221a installing graphviz 2020-12-18 14:13:26 +01:00
Julian Hammer
944c268d0d enabling gh_actions 2020-12-18 14:13:26 +01:00
JanLJL
ea1e6ff464 new instructions 2020-12-17 12:38:58 +01:00