Commit Graph

760 Commits

Author SHA1 Message Date
JanLJL
314feb4104 black formatting 2021-10-04 14:33:28 +02:00
JanLJL
217fcff664 black conformity 2021-09-30 15:53:56 +02:00
JanLJL
f1f119f5a0 added instructions for BHIVE 2021-09-29 17:26:44 +02:00
JanLJL
3404d72dc5 enhanced parser 2021-09-29 17:26:27 +02:00
Jan
b2e914db4a added lint configs 2021-08-27 08:14:50 +02:00
JanLJL
1b40c10a1f applied flake8 and black rules 2021-08-26 16:58:19 +02:00
JanLJL
db30a4e36c fixed wrong uops info import with masking of some gather/scatter 2021-08-26 11:05:33 +02:00
JanLJL
44f3c0376e version bump 2021-07-21 02:41:05 +02:00
JanLJL
a721d0941d added more load instrs 2021-07-21 02:34:31 +02:00
JanLJL
2d17a48604 updated a few DB entries 2021-06-02 16:37:18 +02:00
JanLJL
98de6f2a92 version bump 2021-06-01 00:13:38 +02:00
JanLJL
ec771dbe91 fixed parsing of reg ranges and lists 2021-06-01 00:10:05 +02:00
JanLJL
8cc36e6691 version bump 2021-05-10 12:56:35 +02:00
JanLJL
5038ce7a15 changed immediate type from str to int 2021-05-10 01:12:30 +02:00
JanLJL
261ad22ccb version bump 2021-05-05 11:16:43 +02:00
JanLJL
8e09927178 added tests for timeout in LCD analyis 2021-05-02 22:48:22 +02:00
JanLJL
e988251c49 fixed bug in case of no uarch in CLI 2021-05-02 22:39:07 +02:00
JanLJL
30d6f4d737 fix #73 2021-05-02 22:22:30 +02:00
Julian Hammer
5ac87b3475 added missing testfile for sve instructions 2021-05-02 21:44:17 +02:00
Julian Hammer
31e35d8815 addd LD2 and ST2 instructions to a64fx 2021-04-23 13:33:32 +02:00
Julian Hammer
c4163dd930 Merge branch 'master' of github.com:RRZE-HPC/OSACA 2021-04-23 13:18:23 +02:00
Julian Hammer
781b8b6b89 improved register range and list support on AArch64 2021-04-23 13:12:18 +02:00
JanLJL
129052a84a fixed incompatibilty to py3.6 2021-04-20 13:59:56 +02:00
JanLJL
c1234e2d45 set min requirement to py3.6 2021-04-20 13:59:32 +02:00
JanLJL
08b29e93a0 added wheel to pypi publishing 2021-04-19 11:33:29 +02:00
JanLJL
5ef7aa3466 fixed duplicate hyperlink tags 2021-04-19 10:58:11 +02:00
JanLJL
730132881d added no timeout option 2021-04-19 10:57:51 +02:00
JanLJL
e93672d5ff version bump 2021-04-19 10:14:26 +02:00
JanLJL
af947c54f5 Merge branch 'master' of https://github.com/RRZE-HPC/OSACA 2021-04-19 00:34:32 +02:00
JanLJL
b2af6a61c6 bugfixed CLX as uarch flag 2021-04-19 00:34:21 +02:00
Jan
e139833eab added lcd-timeout flag, citations and updated credits 2021-04-19 00:27:24 +02:00
JanLJL
6208536863 added CLX as synonym for CSX uarch 2021-04-19 00:05:53 +02:00
JanLJL
5b95f1f909 enhanced LCD analysis by making it parallel and added timeout flag 2021-04-19 00:04:03 +02:00
JanLJL
3d580960b6 keep dependency paths as generators instead of lists 2021-04-17 12:46:44 +02:00
JanLJL
e3b08a2574 allow different base with prefix for offset values 2021-04-17 11:06:39 +02:00
Julian Hammer
f05c7fa637 version bump 2021-04-15 14:47:09 +02:00
Julian Hammer
c54685ee2c readme added to validation folder 2021-04-15 14:45:23 +02:00
Julian
04836cf3f9 Validation (#71)
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.

build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.

For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz

The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb

Quite a few changes on OSACA included:

Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
5990cdc2b4 added missing instructions to all DBs 2021-04-05 16:47:52 +02:00
JanLJL
7a0120205c changed consideration of masking for database back to NO 2021-03-25 11:50:17 +01:00
JanLJL
23b5b4f4d3 added new instructions and bugfixed the wrong import from uops.info for masks for A FEW instructions 2021-03-25 11:47:15 +01:00
Julian Hammer
48db31b430 flake8 to ignore some errors and small style improvements 2021-03-11 12:52:34 +01:00
Julian Hammer
0f8e53e918 making flake8 happy 2021-03-11 12:29:14 +01:00
Julian Hammer
1147ea78bf fixed output redirection 2021-03-11 12:17:46 +01:00
Julian Hammer
0be5632814 added blame ignore file 2021-03-11 12:03:40 +01:00
Julian Hammer
afa607e546 migrate code style to Black 2021-03-11 12:02:45 +01:00
Julian Hammer
f284ae84ca sanity check validity of operand entries 2021-03-11 11:38:25 +01:00
Julian Hammer
d786acad34 fixed typos and added missing default load/store info 2021-03-11 10:10:09 +01:00
JanLJL
f8af078d3d guarantee 0 latency for None values in DB 2021-03-11 01:55:57 +01:00
Julian Hammer
f7777954ad fix #66 typo in icl.yml
cmp failed with immediates
2021-03-10 11:57:21 +01:00