JanLJL
5ef7aa3466
fixed duplicate hyperlink tags
2021-04-19 10:58:11 +02:00
JanLJL
730132881d
added no timeout option
2021-04-19 10:57:51 +02:00
JanLJL
e93672d5ff
version bump
2021-04-19 10:14:26 +02:00
JanLJL
af947c54f5
Merge branch 'master' of https://github.com/RRZE-HPC/OSACA
2021-04-19 00:34:32 +02:00
JanLJL
b2af6a61c6
bugfixed CLX as uarch flag
2021-04-19 00:34:21 +02:00
Jan
e139833eab
added lcd-timeout flag, citations and updated credits
2021-04-19 00:27:24 +02:00
JanLJL
6208536863
added CLX as synonym for CSX uarch
2021-04-19 00:05:53 +02:00
JanLJL
5b95f1f909
enhanced LCD analysis by making it parallel and added timeout flag
2021-04-19 00:04:03 +02:00
JanLJL
3d580960b6
keep dependency paths as generators instead of lists
2021-04-17 12:46:44 +02:00
JanLJL
e3b08a2574
allow different base with prefix for offset values
2021-04-17 11:06:39 +02:00
Julian Hammer
f05c7fa637
version bump
2021-04-15 14:47:09 +02:00
Julian Hammer
c54685ee2c
readme added to validation folder
2021-04-15 14:45:23 +02:00
Julian
04836cf3f9
Validation ( #71 )
...
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.
build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.
For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz
The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb
Quite a few changes on OSACA included:
Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
5990cdc2b4
added missing instructions to all DBs
2021-04-05 16:47:52 +02:00
JanLJL
7a0120205c
changed consideration of masking for database back to NO
2021-03-25 11:50:17 +01:00
JanLJL
23b5b4f4d3
added new instructions and bugfixed the wrong import from uops.info for masks for A FEW instructions
2021-03-25 11:47:15 +01:00
Julian Hammer
48db31b430
flake8 to ignore some errors and small style improvements
2021-03-11 12:52:34 +01:00
Julian Hammer
0f8e53e918
making flake8 happy
2021-03-11 12:29:14 +01:00
Julian Hammer
1147ea78bf
fixed output redirection
2021-03-11 12:17:46 +01:00
Julian Hammer
0be5632814
added blame ignore file
2021-03-11 12:03:40 +01:00
Julian Hammer
afa607e546
migrate code style to Black
2021-03-11 12:02:45 +01:00
Julian Hammer
f284ae84ca
sanity check validity of operand entries
2021-03-11 11:38:25 +01:00
Julian Hammer
d786acad34
fixed typos and added missing default load/store info
2021-03-11 10:10:09 +01:00
JanLJL
f8af078d3d
guarantee 0 latency for None values in DB
2021-03-11 01:55:57 +01:00
Julian Hammer
f7777954ad
fix #66 typo in icl.yml
...
cmp failed with immediates
2021-03-10 11:57:21 +01:00
Jan
87cde04dbf
Merge pull request #65 from RRZE-HPC/throw_parsing_error
...
passing parsing errors to the outside
2021-03-08 09:38:24 +01:00
Julian Hammer
c788e7b97b
passing parsing errors to the outside
2021-03-05 18:07:36 +01:00
JanLJL
89aa82f9c8
changed test after adjustment in parser
2021-02-25 08:12:10 +01:00
JanLJL
70ddce85c9
fixed AArch64 parser for register shifts and new instructions for A64FX
2021-02-25 07:43:42 +01:00
Julian Hammer
fb1754b56d
add "sub gpr, gpr" to skx
2021-02-15 11:41:21 +01:00
Julian Hammer
307ad73696
add "vmovapd zmm, zmm" to skx
2021-02-15 11:41:20 +01:00
JanLJL
36566589ae
new instructions
2021-01-31 18:11:06 +01:00
JanLJL
51e26b90d6
minor model fixes
2021-01-26 12:56:19 +01:00
JanLJL
cbcd868297
cover kernel with only unknown instructions
2021-01-07 12:40:16 +01:00
Jan
68f1602167
Merge pull request #62 from jdomke/attfix
...
att parser: workaround for crash with "jg,pt" mnemonic
For now we will ignore the branch taken/not-taken indication and will only keep the condition in the mnemonic.
2021-01-07 10:23:39 +01:00
Jan
ffcdee5ed9
suppress branch indication in mnemonic for now
2021-01-07 10:20:57 +01:00
JanLJL
5b85041bd5
fixed DV port bug
2021-01-07 09:59:18 +01:00
jdomke
eb1a8a2674
att parser: support for more mnemonic
...
found some 'jg,pt' in icc/mkl generated binaries which crashed the
parser, here an example:
dd8ccd: 3e 7f 90 jg,pt dd8c60 <mkl_blas_avx2_dtrsm_kernel...
2021-01-05 15:26:22 +09:00
JanLJL
7b83e4a370
fixes #61
2020-12-28 10:33:54 +01:00
Jan
427eebfbfe
Changed badge target to GH Actions
2020-12-18 14:13:26 +01:00
Julian Hammer
c63a5f5c22
updated travis badge to gh action
2020-12-18 14:13:26 +01:00
Julian Hammer
d49f7838ee
disabled py35 with tox
2020-12-18 14:13:26 +01:00
Julian Hammer
7b575f1d1c
configured flake8 max-line-length
2020-12-18 14:13:26 +01:00
Julian Hammer
a6adb4bed8
thrid wheel was to much
2020-12-18 14:13:26 +01:00
Julian Hammer
2a2159e302
merged test and publish again
2020-12-18 14:13:26 +01:00
Julian Hammer
66ea9629ea
added missing runs-on
2020-12-18 14:13:26 +01:00
Julian Hammer
e06823a568
installing libgraphviz-dev
2020-12-18 14:13:26 +01:00
Julian Hammer
aa618a221a
installing graphviz
2020-12-18 14:13:26 +01:00
Julian Hammer
944c268d0d
enabling gh_actions
2020-12-18 14:13:26 +01:00
JanLJL
ea1e6ff464
new instructions
2020-12-17 12:38:58 +01:00