JanLJL
314feb4104
black formatting
2021-10-04 14:33:28 +02:00
JanLJL
217fcff664
black conformity
2021-09-30 15:53:56 +02:00
JanLJL
f1f119f5a0
added instructions for BHIVE
2021-09-29 17:26:44 +02:00
JanLJL
3404d72dc5
enhanced parser
2021-09-29 17:26:27 +02:00
Jan
b2e914db4a
added lint configs
2021-08-27 08:14:50 +02:00
JanLJL
1b40c10a1f
applied flake8 and black rules
2021-08-26 16:58:19 +02:00
JanLJL
db30a4e36c
fixed wrong uops info import with masking of some gather/scatter
2021-08-26 11:05:33 +02:00
JanLJL
44f3c0376e
version bump
2021-07-21 02:41:05 +02:00
JanLJL
a721d0941d
added more load instrs
2021-07-21 02:34:31 +02:00
JanLJL
2d17a48604
updated a few DB entries
2021-06-02 16:37:18 +02:00
JanLJL
98de6f2a92
version bump
2021-06-01 00:13:38 +02:00
JanLJL
ec771dbe91
fixed parsing of reg ranges and lists
2021-06-01 00:10:05 +02:00
JanLJL
8cc36e6691
version bump
2021-05-10 12:56:35 +02:00
JanLJL
5038ce7a15
changed immediate type from str to int
2021-05-10 01:12:30 +02:00
JanLJL
261ad22ccb
version bump
2021-05-05 11:16:43 +02:00
JanLJL
8e09927178
added tests for timeout in LCD analyis
2021-05-02 22:48:22 +02:00
JanLJL
e988251c49
fixed bug in case of no uarch in CLI
2021-05-02 22:39:07 +02:00
JanLJL
30d6f4d737
fix #73
2021-05-02 22:22:30 +02:00
Julian Hammer
5ac87b3475
added missing testfile for sve instructions
2021-05-02 21:44:17 +02:00
Julian Hammer
31e35d8815
addd LD2 and ST2 instructions to a64fx
2021-04-23 13:33:32 +02:00
Julian Hammer
c4163dd930
Merge branch 'master' of github.com:RRZE-HPC/OSACA
2021-04-23 13:18:23 +02:00
Julian Hammer
781b8b6b89
improved register range and list support on AArch64
2021-04-23 13:12:18 +02:00
JanLJL
129052a84a
fixed incompatibilty to py3.6
2021-04-20 13:59:56 +02:00
JanLJL
c1234e2d45
set min requirement to py3.6
2021-04-20 13:59:32 +02:00
JanLJL
08b29e93a0
added wheel to pypi publishing
2021-04-19 11:33:29 +02:00
JanLJL
5ef7aa3466
fixed duplicate hyperlink tags
2021-04-19 10:58:11 +02:00
JanLJL
730132881d
added no timeout option
2021-04-19 10:57:51 +02:00
JanLJL
e93672d5ff
version bump
2021-04-19 10:14:26 +02:00
JanLJL
af947c54f5
Merge branch 'master' of https://github.com/RRZE-HPC/OSACA
2021-04-19 00:34:32 +02:00
JanLJL
b2af6a61c6
bugfixed CLX as uarch flag
2021-04-19 00:34:21 +02:00
Jan
e139833eab
added lcd-timeout flag, citations and updated credits
2021-04-19 00:27:24 +02:00
JanLJL
6208536863
added CLX as synonym for CSX uarch
2021-04-19 00:05:53 +02:00
JanLJL
5b95f1f909
enhanced LCD analysis by making it parallel and added timeout flag
2021-04-19 00:04:03 +02:00
JanLJL
3d580960b6
keep dependency paths as generators instead of lists
2021-04-17 12:46:44 +02:00
JanLJL
e3b08a2574
allow different base with prefix for offset values
2021-04-17 11:06:39 +02:00
Julian Hammer
f05c7fa637
version bump
2021-04-15 14:47:09 +02:00
Julian Hammer
c54685ee2c
readme added to validation folder
2021-04-15 14:45:23 +02:00
Julian
04836cf3f9
Validation ( #71 )
...
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.
build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.
For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz
The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb
Quite a few changes on OSACA included:
Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
5990cdc2b4
added missing instructions to all DBs
2021-04-05 16:47:52 +02:00
JanLJL
7a0120205c
changed consideration of masking for database back to NO
2021-03-25 11:50:17 +01:00
JanLJL
23b5b4f4d3
added new instructions and bugfixed the wrong import from uops.info for masks for A FEW instructions
2021-03-25 11:47:15 +01:00
Julian Hammer
48db31b430
flake8 to ignore some errors and small style improvements
2021-03-11 12:52:34 +01:00
Julian Hammer
0f8e53e918
making flake8 happy
2021-03-11 12:29:14 +01:00
Julian Hammer
1147ea78bf
fixed output redirection
2021-03-11 12:17:46 +01:00
Julian Hammer
0be5632814
added blame ignore file
2021-03-11 12:03:40 +01:00
Julian Hammer
afa607e546
migrate code style to Black
2021-03-11 12:02:45 +01:00
Julian Hammer
f284ae84ca
sanity check validity of operand entries
2021-03-11 11:38:25 +01:00
Julian Hammer
d786acad34
fixed typos and added missing default load/store info
2021-03-11 10:10:09 +01:00
JanLJL
f8af078d3d
guarantee 0 latency for None values in DB
2021-03-11 01:55:57 +01:00
Julian Hammer
f7777954ad
fix #66 typo in icl.yml
...
cmp failed with immediates
2021-03-10 11:57:21 +01:00