JanLJL
97756faa04
Merge branch 'master' into pr-armcc
2023-03-14 17:50:48 +01:00
JanLJL
0985e81b23
added more dependency analysis for post/pre indexing and condition flags
2023-03-14 17:00:02 +01:00
JanLJL
23139d4d5b
bugfix, resolved #90
2023-03-07 17:05:31 +01:00
JanLJL
72f69fa707
added fallback search in arch/ISA model for ARM instructions with shape/cc suffixes
2023-03-03 15:11:40 +01:00
JanLJL
c1373fe44c
enabled indexing without shape and lane for vector regs
2023-03-03 14:41:48 +01:00
JanLJL
3d7e91386b
new black formatting
2023-02-15 18:54:53 +01:00
JanLJL
2a1ee9423f
new black formatting
2023-02-15 18:46:20 +01:00
JanLJL
ead1ffc5ca
black formatting
2023-02-15 16:53:26 +01:00
JanLJL
d5aaceb9c4
added Zen3 support
2022-09-27 18:39:14 +02:00
JanLJL
1d847f4510
black formatting
2022-06-22 17:12:53 +02:00
JanLJL
e4df8893dd
fixed #88
2022-06-22 17:09:24 +02:00
JanLJL
1b40c10a1f
applied flake8 and black rules
2021-08-26 16:58:19 +02:00
JanLJL
ec771dbe91
fixed parsing of reg ranges and lists
2021-06-01 00:10:05 +02:00
JanLJL
5038ce7a15
changed immediate type from str to int
2021-05-10 01:12:30 +02:00
JanLJL
8e09927178
added tests for timeout in LCD analyis
2021-05-02 22:48:22 +02:00
Julian Hammer
5ac87b3475
added missing testfile for sve instructions
2021-05-02 21:44:17 +02:00
Julian Hammer
c4163dd930
Merge branch 'master' of github.com:RRZE-HPC/OSACA
2021-04-23 13:18:23 +02:00
Julian Hammer
781b8b6b89
improved register range and list support on AArch64
2021-04-23 13:12:18 +02:00
Julian
04836cf3f9
Validation ( #71 )
...
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.
build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.
For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz
The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb
Quite a few changes on OSACA included:
Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
0f8e53e918
making flake8 happy
2021-03-11 12:29:14 +01:00
Julian Hammer
1147ea78bf
fixed output redirection
2021-03-11 12:17:46 +01:00
Julian Hammer
afa607e546
migrate code style to Black
2021-03-11 12:02:45 +01:00
Julian Hammer
f284ae84ca
sanity check validity of operand entries
2021-03-11 11:38:25 +01:00
JanLJL
89aa82f9c8
changed test after adjustment in parser
2021-02-25 08:12:10 +01:00
JanLJL
40afe51454
added the possibility of a 5th operand
2020-12-06 18:05:59 +01:00
JanLJL
36ca6a064f
fixed typo
2020-11-11 14:11:00 +01:00
JanLJL
4a9c5ff789
fixed test after changing TP value of instruction
2020-11-11 14:04:07 +01:00
Julian Hammer
7f8b596fc9
improved performance of arch_semantics and reg dependency matching
2020-11-09 19:27:47 +01:00
JanLJL
9b29c77b5d
enhanced length warning
2020-11-06 15:49:13 +01:00
JanLJL
58b82699a6
added missing test file
2020-11-06 15:07:57 +01:00
JanLJL
1ba46da7c9
minor bugfix in HW model and added user warnings for more insight
2020-11-06 15:06:36 +01:00
JanLJL
00156f4d86
added --lines option
2020-11-06 12:57:41 +01:00
JanLJL
60d6107cba
Merge branch 'master' of https://github.com/RRZE-HPC/OSACA
2020-10-29 13:00:09 +01:00
JanLJL
bfd07a83a4
autodetect ISA and default uarch for ISA
2020-10-29 13:00:02 +01:00
JanLJL
2f6f57bdce
fixed bug in tests and removed unused imports
2020-10-28 19:29:48 +01:00
Julian Hammer
e29cfb3185
new caching structure with support for distribution
2020-10-28 16:29:55 +01:00
JanLJL
2df4b353ed
no \t replacement before any other point than user output
2020-10-16 09:44:18 +02:00
JanLJL
633e93ee4b
Merge branch 'master' into A64FX
2020-10-15 22:44:12 +02:00
JanLJL
fb0ba144a5
adjusted tests for AArch64
2020-10-15 17:56:08 +02:00
Julian Hammer
f798014b51
extended and cleaned up marker tests
2020-10-15 13:54:18 +02:00
Julian Hammer
e351aea7e3
removed a nother set of no-maker tests
2020-10-13 09:03:13 +02:00
Julian Hammer
11a51f30ca
removed maker missing tests
2020-10-12 19:34:04 +02:00
JanLJL
ab8ffdf171
added tests
2020-04-02 08:57:26 +02:00
JanLJL
2e2187a180
added more tests
2020-03-26 10:19:14 +01:00
JanLJL
49592f77fd
enhanced x86 parser for directives
2020-03-26 10:02:39 +01:00
JanLJL
ac409d4a7f
added test to check all micro-archs
2020-03-16 21:31:14 +01:00
JanLJL
b2e1e6f930
implemented online check for src/dst of operands based on felixcloutier
2020-03-12 15:08:14 +01:00
JanLJL
d54739812a
minor fixes
2020-02-27 18:11:32 +01:00
JanLJL
dbde8c958e
reduced testing runtime
2020-02-27 16:23:56 +01:00
JanLJL
a204762fa3
enabled kerncraft marker insertion for aarch64 and more tests
2020-02-27 16:00:23 +01:00