Loading…
Attending this event?
October 22-23, 2024
Santa Clara, CA
View More Details & Registration

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for RISC-V Summit to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.
Expo Hall - Exhibit Hall A (Level 1) clear filter
arrow_back View All Dates
Wednesday, October 23
 

2:00pm PDT

Career Day
Wednesday October 23, 2024 2:00pm - 5:00pm PDT
Attendees are encouraged to connect with Industry leaders and drop off their resume at Exhibitors' booths, while also attending the last set of keynotes from RISC-V Summit.

Learn more.

*Separate registration required.
Wednesday October 23, 2024 2:00pm - 5:00pm PDT
Expo Hall - Exhibit Hall A (Level 1)

3:15pm PDT

AI/ML Poster Sessions
Wednesday October 23, 2024 3:15pm - 3:55pm PDT
High Performance and Efficiency 512-B & 1024-B VLEN Vector Processor and AI Related Accelerator - Nathan Ma, Nuclei System Technology
In this presentation, we delve into the powerful synergy between RISC-V Vector Processing, with a spotlight on the transformative RVV1.0 extension (specifically on VLEN=512b and 1024b), and AI acceleration. RISC-V, becomes even more impactful with the introduction of the RVV1.0 extension, specifically designed to elevate vector processing capabilities. In 2024, we released our Intelligence Class Core IP Series, specifically focus on AI applications and others require intensive parallel vector computing capability.

Enhancing RISC-V ISA to Support Sub-FP8 Quantization for Machine Learning Models -
Mengshiun Yu &
Jhih-Kuan Lin, National Tsing Hua University
In this session we'll present our research proposes extending the RISC-V Instruction Set Architecture (ISA) to support sub-FP8 quantized data formats, optimizing AI and machine learning models for low-power edge devices. The study develops new instructions to enable the RISC-V CPU core to handle data types below FP8, such as 6-bit and 4-bit formats. These improvements enhance AI workload performance and energy efficiency, allowing complex machine learning tasks to be performed locally on edge devices like smartphones, IoT devices, and wearables. The proposed ISA extension supports mixed-precision workloads and ensures backward compatibility with existing hardware for easy adoption. The research includes designing a new sub-FP8 extension with computational, configuration, load/store, and conversion instructions. The design is demonstrated with two examples using assembly code: one for adding two FP8 (E5M2) values and another for performing saxpy computation with vector extension.

Towards Generative AI for RISC-V Verification - Sergei Chirkunov, Imagination Technologies
Generative AI has considerable potential in CPU verification. In this work, we adapt networks and techniques developed in the context of large language models (LLMs) for natural language processing to RISC-V assembly sequences to facilitate future applications to CPU verification. In particular, we demonstrate the ability to generate novel assembly sequences of guaranteed-valid instructions with a small, efficient language model. We anticipate that our work will ultimately facilitate a variety of verification tasks such as stimulus generation, assessment of the similarity between sequences, and identification of minimal test batteries that exercise the state space.

The Efficient Way to Design a RISC-V Edge AI Processor with Software Hardware Co-Design Methodology - Meng Zhang, Terapines Technology (Wuhan) Co., Ltd 
This talk will show you how to improve the performance of an AI model running on a virtualized RISC-V architecture with software hardware co-design methodology. This method can be done all the way from micro-architecture design, to support adding customized instructions in compiler, debugger and simulator, and to profile AI model performance on virtualized platform by one person in as short as a few hours, without knowing how to customize compiler, debugger or simulator as all of those have automatically done in the our software hardware co-design flow.

Creating Custom RISC-V Processors Using ASIP Design Tools: A Neural Network Acceleration Case Study - Gert Goossens, Synopsys
The AI revolution triggers an increased awareness for application-specific instruction-set processors (ASIPs). A RISC-V architecture can be extended with specialized datapaths, storages, and custom instructions to accelerate AI workloads. New instructions can be encoded in RISC-V's reserved opcode space or in additional parallel slots of an extended long instruction word. Notwithstanding the specialization, compatibility with and reuse of the RISC-V ecosystem is maintained.
Synopsys’ ASIP Designer tool-suite enables the design of custom RISC-V processors. Starting from a formal ISA model, it assists designers in selecting ISA extensions, generates an SDK with an optimizing compiler supporting the extensions, and produces an efficient RTL implementation.
We illustrate this approach with the design of a custom RISC-V processor to accelerate convolutional neural network algorithms for edge AI, with programming support for TensorFlow Lite for Microcontrollers (TFLM). ISA specialization includes the introduction of 4-lane SIMD with a local vector memory, 4 specialized convolution units with 16 multipliers each, dedicated accumulator registers, and 2-way instruction-level parallelism.

Towards an Integrated Matrix Extension: Workload Analysis of CNN Inference with QEMU TCG Plugins - Matheus Ferst, Instituto de Pesquisas ELDORADO
Following the gap analysis done in the second half of 2023, the SIG-Vector has been working on specifying instructions to accelerate matrix operations. Two Task Groups were proposed to explore different approaches. The "Attached Matrix Extension" (AME) is working on a set of instructions independent of other extensions and requires new registers to hold matrix data. The Integrated Matrix Extension (IME) proposes the reuse of the Vector Registers introduced by the V extension. The AME solution is similar to how other architectures added matrix operations, like Intel's AMX and ARM's SME, while the IME proposal resembles how the POWER architecture added matrix operations. The IME might also help applications that interleave matrix and vector operations by avoiding data movement between different types of registers.
To verify how commonly that happens on AI/ML workloads, we developed a QEMU TCG Plugin to instrument the inference of eight CNN models optimized to use the IME-like POWER10 matrix instructions. The results also show some types of vector operations that interact with matrix data and would be helpful in an AME implementation to avoid sending data back to memory.


Enhancing the Future of AI/ML with Attached Matrix Extension - Jing Qui, Alibaba
We've now updated Xuantie Attached Matrix Extension ISA to keep pace with rapid advances in AI.
The new matrix ISA uses 64-bit instructions. These self-contained long instructions can support more architectural registers, facilitate sparse operations, include longer immediates and more metadata. This enhanced encoding scheme increases both the flexibility and efficiency of matrix computations. Another enhancement is the introduction of structured sparsity techniques that allow for variable sparsity ratios (N:M sparsity) across k dimensions. The new extension also supports innovative data types, such as int4/fp8, commonly used in large language models. In addition to multi-precision, it also supports mixed-precision operations. Har
Speakers
avatar for Jing Qiu

Jing Qiu

technology expert, Alibaba
QiuJing is a technology expert in the CPU R&D department at Alibaba. His current work focuses on the design and specification of the matrix-related and AI domain-specific architecture of the Xuantie processors.QiuJing received his Ph.D. in Circuit and System from Zhejiang University... Read More →
avatar for Gert Goossens

Gert Goossens

Executive Director of Engineering, Synopsys
Gert Goossens is an Executive Director of Engineering at Synopsys, where he is currently leading the company’s tool development group for Application-Specific Instruction-set Processors (ASIPs). Previously, he was a co-founder and the CEO of Target Compiler Technologies, the company... Read More →
avatar for Nathan Ma

Nathan Ma

Senior Director of Strategy and Business Development, Nuclei System Technology
Nathan Ma started his career in Marvell and SiFive before joined Nuclei as Senior Director of Strategy and Business Development. Nathan is now managing Nuclei's fund raising, technical marketing and global business development.
avatar for Jhih-Kuan Lin

Jhih-Kuan Lin

graduate student, National Tsing Hua University
Jhih-Kuan Lin is a dedicated graduate student at the Parallel and Distributed Systems Laboratory (PLLAB) in the Department of Computer Science at National Tsing Hua University (NTHU). Jhih-Kuan Lin's research focuses on the cutting-edge development and optimization of the RISC-V... Read More →
avatar for Mengshiun Yu

Mengshiun Yu

Ph.D. candidate, Department of Computer Science at National Tsinghua University, Taiwan
MENG-SHIUN YU is currently a Ph.D. candidate in the Department of Computer Science at National Tsinghua University, Taiwan. His research interests include compiler optimization for deep neural networks and computer vision, and compiler construction for hardware accelerators. Currently... Read More →
avatar for Sergei Chirkunov

Sergei Chirkunov

Research Engineer, Imagination Technologies
Sergei has several years of research experience in the semiconductor IP industry. His main research interests include applied AI (primarily language modelling and graphics), computer architecture, and RISC-V verification tooling.
avatar for Meng Zhang

Meng Zhang

Software Engineer, Terapines Technology (Wuhan) Co., Ltd
Software Engineer from Company Terapines Technology (Wuhan) Co., Ltd
avatar for Matheus Ferst

Matheus Ferst

Software Developer, Instituto de Pesquisas ELDORADO
Matheus is a software developer at the Embedded Computing Department of Instituto de Pesquisas Eldorado. He graduated in Computer Engineering at Universidade Tecnológica Federal do Paraná and holds a Master's in Electrical Engineering from the same institution. He is also an open-source... Read More →
Wednesday October 23, 2024 3:15pm - 3:55pm PDT
Expo Hall - Exhibit Hall A (Level 1)

3:15pm PDT

Automotive, Embedded & Mobile Poster Sessions
Wednesday October 23, 2024 3:15pm - 3:55pm PDT
Optimizing Image Signal Processing with RISC-V FPGA - Umer Imran &Bilal Zafar, 10xEngineers
In this session, we will explore the successful implementation of Infinite-ISP, a comprehensive Image Signal Processor (ISP) development platform, on an Efinix FPGA leveraging a RISC-V core. Infinite-ISP provides a full-stack solution, from algorithm development to RTL design, FPGA/ASIC implementation, and associated firmware and tools, creating a unified platform that accelerates ISP development. Our case study will delve into the technical details of integrating Infinite-ISP with a RISC-V based FPGA, highlighting the challenges faced and the innovative solutions devised to overcome them. Attendees will learn about the performance benchmarks achieved and the significant enhancements in efficiency and scalability. Additionally, we will discuss the broader implications of using an open-source RISC-V architecture in specialized applications like ISP development. Join us to discover how leveraging RISC-V for ISP development can open new possibilities in image processing technology. This presentation is ideal for engineers, developers, and decision-makers interested in the cutting-edge intersection of RISC-V and image signal processing.

Longnail: Hardware Synthesis of CoreDSL Custom Instructions for MCU- and Application-Class Cores - Tammo Mürmann & Florian Meisel, Technical University of Darmstadt
Custom instruction set architecture extensions (ISAX) are an energy-efficient and cost-effective way to accelerate modern workloads. However, exploring different combinations of base cores and ISAXes for a specific application requires automation and a level of portability across microarchitectures not provided by existing approaches.
To that end, we present an end-to-end flow for ISAX specification, generation, and integration into a number of host cores with a range of different microarchitectures. For ISAX specification, we leverage CoreDSL, an open-source C-like behavioral architecture description language. Hardware generation is handled by Longnail, a domain-specific high-level synthesis tool that compiles CoreDSL specifications into hardware modules compatible with the open-source SCAIE-V extension interface, which we use for automatic integration into the host cores.
We demonstrate our tooling by generating ISAXes using a mix of features, including complex multi-cycle computations, memory accesses, branch instructions, custom registers, and decoupled execution across five MCUs and two application-class cores, and evaluate the quality of results on a 22nm ASIC process.

RISC-V & Its Role in Silicon Lifecycle Management - Vivek Chickermane, Siemens EDA
This session will focus on the use of RISC-V processors and the RISC-V Trace specification in safety critical applications and the ability to implement embedded solutions that serve as a foundation for a comprehensive SoC Silicon debug and continuous monitoring system.

Introduction of Deploying the Rv64ilp32 ABI on the Kendryte K230d for Productization - Ren Guo, Alibaba XuanTie
Over the past year, the Alibaba XuanTie and PLCT teams have been dedicated to promoting the rv64ilp32 ABI, as it effectively addresses the need to run ILP32 software on existing RVA Profiles. Unlike before, the RISC-V 64ilp32 ABI steers clear of the Linux userspace scenario, focusing instead on underlying software such as the Linux kernel, RTOS, firmware, and hypervisors. We completed the first productized SDK based on the rv64ilp32 ABI on Canaan's k230d chip, enabling rv64ilp32 Nuttx and Linux. The k230d is Canaan Kendryte's new product, a repackaged chip based on k230 that incorporates 128MB of internal memory to reduce costs. Thus, there is a strong demand for the ILP32 ABI. This presentation will demonstrate the advantages of rv64ilp32 through actual test data on the k230d EVB: it avoids a 30% waste of memory footprint and significantly improves the performance of Linux linked list traversal. We innovated sign-extend addressing to replace the traditional zero-extend addressing. The newer XuanTie processors support a new relaxed-extend addressing mode to gain more performance. Finally, the presentation will share progress and plans for the rv64ilp32 ABI on Embedded Hypervisors.


Speakers
BZ

Bilal Zafar

Founder, 10xEngineers
Looking for an engineering outsourcing solutions provider who is a strategic partner rather than a mere service provider to ease your engineering resource challenges? 10x engineers is the right choice for you. Our RISC-V DV teams are led by experienced industry veterans ("10x" engineers... Read More →
avatar for Ren Guo

Ren Guo

Staff Engineer, Alibaba
A Linux kernel developer focuses on the CPU subsystem, including virtualization, IOMMU, and PCI-e. Currently dedicated to running ILP32 on RISC-V 64-bit ISA.
avatar for Umer Imran

Umer Imran

Sr. Design Verification Engineer, 10xEngineers
Umer Imran is a Manager/ Senior Engineer with over 4 years of experience specializing in Core and SoC Verification. His career is marked by a series of achievements, including successful verification planning, robust test bench development, extensive coverage analysis, code and functional... Read More →
avatar for Tammo Mürmann

Tammo Mürmann

Technical University of Darmstadt
Tammo Mürmann has just commenced his PhD studies at the Technical University of Darmstadt as part of the Embedded Systems and Applications Group (ESA). During his studies, he already participated in the development of a high-level synthesis compiler (Longnail) that was recently presented... Read More →
avatar for Florian Meisel

Florian Meisel

Technical University of Darmstadt
Florian Meisel is a PhD candidate at Technical University of Darmstadt and part of the Embedded Systems and Applications Group (ESA). As part of his studies, he has worked on the design and integration of a security tracing interface into a range of RISC-V cores (RT-LIFE) and its... Read More →
avatar for Vivek Chickermane

Vivek Chickermane

Senior Director, Siemens EDA
Dr Vivek Chickermane is a Senior Director for Embedded Analytics SW R&D at Siemens EDA. He has over 25 years of R&D experience at Siemens, Cadence, and IBM in the areas of Design-for-Test, Logic Synthesis and Silicon Lifecycle Management. Dr Chickermane is an Associate Editor of IEEE... Read More →
Wednesday October 23, 2024 3:15pm - 3:55pm PDT
Expo Hall - Exhibit Hall A (Level 1)
 
Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -