JanLJL
5eb3e07ad6
Merge branch 'master' of https://github.com/RRZE-HPC/OSACA
2021-04-19 00:34:32 +02:00
JanLJL
a82a0e24a3
bugfixed CLX as uarch flag
2021-04-19 00:34:21 +02:00
Jan
6db08c7e8e
added lcd-timeout flag, citations and updated credits
2021-04-19 00:27:24 +02:00
JanLJL
e6a54ee131
added CLX as synonym for CSX uarch
2021-04-19 00:05:53 +02:00
JanLJL
152360bad2
enhanced LCD analysis by making it parallel and added timeout flag
2021-04-19 00:04:03 +02:00
JanLJL
607d459569
keep dependency paths as generators instead of lists
2021-04-17 12:46:44 +02:00
JanLJL
b033b3b7aa
allow different base with prefix for offset values
2021-04-17 11:06:39 +02:00
Julian Hammer
0c295dc847
version bump
v0.4.0
2021-04-15 14:47:09 +02:00
Julian Hammer
5588e41492
readme added to validation folder
2021-04-15 14:45:23 +02:00
Julian
08440ed5e1
Validation ( #71 )
...
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.
build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.
For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz
The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb
Quite a few changes on OSACA included:
Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
25a0e0607d
added missing instructions to all DBs
2021-04-05 16:47:52 +02:00
JanLJL
b0e35316f0
changed consideration of masking for database back to NO
2021-03-25 11:50:17 +01:00
JanLJL
94313ec772
added new instructions and bugfixed the wrong import from uops.info for masks for A FEW instructions
2021-03-25 11:47:15 +01:00
Julian Hammer
63563ecabc
flake8 to ignore some errors and small style improvements
2021-03-11 12:52:34 +01:00
Julian Hammer
b7625a4a25
making flake8 happy
2021-03-11 12:29:14 +01:00
Julian Hammer
7da2f5bd7b
fixed output redirection
2021-03-11 12:17:46 +01:00
Julian Hammer
6158a83b4f
added blame ignore file
2021-03-11 12:03:40 +01:00
Julian Hammer
6204c90934
migrate code style to Black
2021-03-11 12:02:45 +01:00
Julian Hammer
1ebe5ecfbd
sanity check validity of operand entries
2021-03-11 11:38:25 +01:00
Julian Hammer
0c5ac26f3f
fixed typos and added missing default load/store info
2021-03-11 10:10:09 +01:00
JanLJL
9a13e5cbc5
guarantee 0 latency for None values in DB
2021-03-11 01:55:57 +01:00
Julian Hammer
dcf3e38612
fix #66 typo in icl.yml
...
cmp failed with immediates
2021-03-10 11:57:21 +01:00
Jan
09a14465c1
Merge pull request #65 from RRZE-HPC/throw_parsing_error
...
passing parsing errors to the outside
2021-03-08 09:38:24 +01:00
Julian Hammer
d7a687909e
passing parsing errors to the outside
2021-03-05 18:07:36 +01:00
JanLJL
f8d53a69d7
changed test after adjustment in parser
2021-02-25 08:12:10 +01:00
JanLJL
74a479fb95
fixed AArch64 parser for register shifts and new instructions for A64FX
2021-02-25 07:43:42 +01:00
Julian Hammer
4fdf312622
add "sub gpr, gpr" to skx
2021-02-15 11:41:21 +01:00
Julian Hammer
803da767f2
add "vmovapd zmm, zmm" to skx
2021-02-15 11:41:20 +01:00
JanLJL
0e69db9de9
new instructions
2021-01-31 18:11:06 +01:00
JanLJL
9f87606ce8
minor model fixes
2021-01-26 12:56:19 +01:00
JanLJL
768a90de10
cover kernel with only unknown instructions
2021-01-07 12:40:16 +01:00
Jan
8c9557760e
Merge pull request #62 from jdomke/attfix
...
att parser: workaround for crash with "jg,pt" mnemonic
For now we will ignore the branch taken/not-taken indication and will only keep the condition in the mnemonic.
2021-01-07 10:23:39 +01:00
Jan
4dbcfbda5d
suppress branch indication in mnemonic for now
2021-01-07 10:20:57 +01:00
JanLJL
ed13cde61b
fixed DV port bug
2021-01-07 09:59:18 +01:00
jdomke
4f8ed13309
att parser: support for more mnemonic
...
found some 'jg,pt' in icc/mkl generated binaries which crashed the
parser, here an example:
dd8ccd: 3e 7f 90 jg,pt dd8c60 <mkl_blas_avx2_dtrsm_kernel...
2021-01-05 15:26:22 +09:00
JanLJL
3c7971b347
fixes #61
2020-12-28 10:33:54 +01:00
Jan
d89a742718
Changed badge target to GH Actions
2020-12-18 14:13:26 +01:00
Julian Hammer
05fdbcf060
updated travis badge to gh action
2020-12-18 14:13:26 +01:00
Julian Hammer
0f86d2d1b2
disabled py35 with tox
2020-12-18 14:13:26 +01:00
Julian Hammer
dff86d456e
configured flake8 max-line-length
2020-12-18 14:13:26 +01:00
Julian Hammer
77e7c3a520
thrid wheel was to much
2020-12-18 14:13:26 +01:00
Julian Hammer
ba957877e3
merged test and publish again
2020-12-18 14:13:26 +01:00
Julian Hammer
1d52362306
added missing runs-on
2020-12-18 14:13:26 +01:00
Julian Hammer
dd37a21c56
installing libgraphviz-dev
2020-12-18 14:13:26 +01:00
Julian Hammer
0bdc180a52
installing graphviz
2020-12-18 14:13:26 +01:00
Julian Hammer
f6a02a8f3e
enabling gh_actions
2020-12-18 14:13:26 +01:00
JanLJL
60f792c4b2
new instructions
2020-12-17 12:38:58 +01:00
JanLJL
449932d75b
version bump
v0.3.14
2020-12-11 01:00:30 +01:00
Jan
c68ad48e6b
Update README.rst
2020-12-11 00:58:34 +01:00
JanLJL
8e3d613843
new instructions
2020-12-09 11:52:10 +01:00