JanLJL
97756faa04
Merge branch 'master' into pr-armcc
2023-03-14 17:50:48 +01:00
JanLJL
0985e81b23
added more dependency analysis for post/pre indexing and condition flags
2023-03-14 17:00:02 +01:00
JanLJL
c1373fe44c
enabled indexing without shape and lane for vector regs
2023-03-03 14:41:48 +01:00
Décio Luiz Gazzoni Filho
b434e30ec1
Support for flags and conditional ops on AArch64
2023-02-19 22:08:42 -03:00
JanLJL
9bbb289f9d
formatting
2022-04-07 12:17:08 +02:00
Qingcai Jiang
728bb03a93
fix a bug about orr in tsv110
2022-03-20 14:53:34 +08:00
Jan
b18f7bf718
black-conform formatting
2021-12-03 14:38:52 +01:00
Qingcai Jiang
3efda4ba6c
fix a bug when the hex_number of address is negative
2021-12-03 15:13:54 +08:00
JanLJL
3404d72dc5
enhanced parser
2021-09-29 17:26:27 +02:00
JanLJL
1b40c10a1f
applied flake8 and black rules
2021-08-26 16:58:19 +02:00
JanLJL
ec771dbe91
fixed parsing of reg ranges and lists
2021-06-01 00:10:05 +02:00
JanLJL
5038ce7a15
changed immediate type from str to int
2021-05-10 01:12:30 +02:00
JanLJL
30d6f4d737
fix #73
2021-05-02 22:22:30 +02:00
Julian Hammer
781b8b6b89
improved register range and list support on AArch64
2021-04-23 13:12:18 +02:00
Julian
04836cf3f9
Validation ( #71 )
...
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.
build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.
For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz
The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb
Quite a few changes on OSACA included:
Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
afa607e546
migrate code style to Black
2021-03-11 12:02:45 +01:00
Julian Hammer
c788e7b97b
passing parsing errors to the outside
2021-03-05 18:07:36 +01:00
JanLJL
70ddce85c9
fixed AArch64 parser for register shifts and new instructions for A64FX
2021-02-25 07:43:42 +01:00
Jan
ffcdee5ed9
suppress branch indication in mnemonic for now
2021-01-07 10:20:57 +01:00
jdomke
eb1a8a2674
att parser: support for more mnemonic
...
found some 'jg,pt' in icc/mkl generated binaries which crashed the
parser, here an example:
dd8ccd: 3e 7f 90 jg,pt dd8c60 <mkl_blas_avx2_dtrsm_kernel...
2021-01-05 15:26:22 +09:00
JanLJL
7b83e4a370
fixes #61
2020-12-28 10:33:54 +01:00
JanLJL
407c1a2beb
add support for :: delimiter in x86 labels and identifiers
2020-12-07 02:45:49 +01:00
JanLJL
9a6866417e
added support for absolute address operands without brackets
2020-12-07 01:57:06 +01:00
JanLJL
c783b2ce2d
enhancements for lookup and parsing AArch64 instrs
2020-12-07 01:18:32 +01:00
JanLJL
40afe51454
added the possibility of a 5th operand
2020-12-06 18:05:59 +01:00
JanLJL
0ca6e12e87
bugfixes
2020-11-21 21:00:58 +01:00
JanLJL
e643520a79
replaced 582b46b3 with smarter solution
2020-11-20 15:44:49 +01:00
JanLJL
582b46b347
bugfix for immediate shifts with hex-base
2020-11-20 15:22:05 +01:00
JanLJL
6b0673d012
bugfixes
2020-11-09 23:29:42 +01:00
Julian Hammer
7f8b596fc9
improved performance of arch_semantics and reg dependency matching
2020-11-09 19:27:47 +01:00
Julian Hammer
0e4d5a6692
singelton for isa parsers
2020-11-09 12:36:14 +01:00
JanLJL
aa6252f270
bugfixes
2020-11-06 12:03:54 +01:00
JanLJL
36cf3be579
try different ISA as fallback when parsing without --arch flag, use SKX as x86 default and enhanced ISA detection heuristic
2020-11-02 15:33:50 +01:00
JanLJL
bfd07a83a4
autodetect ISA and default uarch for ISA
2020-10-29 13:00:02 +01:00
JanLJL
22e36f4870
enhanced hanlding for immediates with shifting
2020-10-21 12:14:21 +02:00
JanLJL
537772e223
refactored operand checking in post-processing
2020-10-16 10:05:08 +02:00
JanLJL
2df4b353ed
no \t replacement before any other point than user output
2020-10-16 09:44:18 +02:00
JanLJL
633e93ee4b
Merge branch 'master' into A64FX
2020-10-15 22:44:12 +02:00
JanLJL
fb0ba144a5
adjusted tests for AArch64
2020-10-15 17:56:08 +02:00
Julian Hammer
dff2e27199
fixed label identifiers by splitting
2020-10-12 15:39:32 +02:00
Julian Hammer
9627686166
paranthesis now suppored in identifier strings
2020-10-12 15:05:52 +02:00
JanLJL
28cdcab05f
enhancecd ARM identifier to support immediate offsets
2020-09-17 22:12:12 +02:00
JanLJL
3bea6c6add
fixed first character match for symbolic identifiers
2020-08-03 18:30:29 +02:00
Jan
3e8a89ffad
Merge pull request #49 from RRZE-HPC/coherent_label_parsing
...
Coherent label parsing
2020-08-03 18:25:20 +02:00
JanLJL
20d8e2b217
save b/f in numeric identifier as suffix tag
2020-08-03 18:08:29 +02:00
JanLJL
6c1424e2f7
removed unnecessary code
2020-08-03 17:14:58 +02:00
JanLJL
a75405b77b
detects numeric label as label
2020-08-03 16:59:48 +02:00
Julian Hammer
da1d39e8e6
labels may now start with numbers
2020-08-03 15:53:29 +02:00
JanLJL
fa3056f364
minor enhancement for mask parsing
2020-08-03 09:07:45 +02:00
JanLJL
4fe299df2d
enhancements for SVE support
2020-08-03 08:54:59 +02:00