Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lower neon_vmaxvq_u8,neon_vmaxvq_s8, neon_vmaxv_u8 and neon_vmaxvq_s8 #1265

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

ghehg
Copy link
Collaborator

@ghehg ghehg commented Jan 3, 2025

Neon definiton
OG implementation
Implementation in this PR is different from OG as

  1. avoided code duplication by extracting out the common pattern
  2. avoided using i32 as return type of the intrinsic call, so eliminated the need for casting result of the intrinsic call. This way of OG's implementation is quite unnecessary IMHO, this is MAX, not ADD or MUL. After all, using the expected type as return type of intrinsic call produces the same ASM code.

@ghehg ghehg marked this pull request as ready for review January 3, 2025 15:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant