ISSCC 2025 PRESS KIT PDF Free Download

Name: ISSCC 2025 PRESS KIT PDF
Author: jgomez

1 / 152

5 views•152 pages

ISSCC 2025 PRESS KIT PDF Free Download

ISSCC 2025 PRESS KIT PDF free Download. Think more deeply and widely.

2025

PRESS KIT

DRAFT 2-1-2025

ISSCC Press Kit Disclaimer

The material presented here is preliminary.

As of November 5, 2024, there is not enough information to guarantee its correctness.

Thus, it must be used with some caution.

ISSCC 2025

VISION STATEMENT

The International Solid-State Circuits Conference is the foremost global forum for presentation

of advances in solid-state circuits and systems-on-a-chip. The Conference offers a unique

opportunity for engineers working at the cutting edge of IC design and use to maintain

technical currency, and to network with leading experts.

© COPYRIGHT 2025 ISSCC—DO NOT REPRODUCE WITHOUT PERMISSION 
Table of Contents 
Table of Contents ............................................................................................................................................ 4 
Preamble .......................................................................................................................................................... 7 
FAQ on ISSCC ...................................................................................................................................................................................... 7 
Plenary Session (Session 1) ............................................................................................................................................................. 10 
Plenary Session — Invited Papers ............................................................................................................... 12 
Plenary Session — Invited Papers ............................................................................................................... 13 
Evening Events (EE) ...................................................................................................................................... 15 
Bingo Networking Event ................................................................................................................................................................... 15 
EE1: Student Research Preview ....................................................................................................................................................... 15 
EE2: Quantum Computing: Whose Qubit is Better? ....................................................................................................................... 15 
EE3: Future of Analog Design: Still Magical or Mostly Digital? .................................................................................................... 16 
EE4: The Next Decade of AI – Barriers, Opportunities, & Directions ............................................................................................ 16 
Session Overviews and Highlights .............................................................................................................. 17 
Conditions of Publication ............................................................................................................................. 18 
PREAMBLE ......................................................................................................................................................................................... 18 
FOOTNOTE ......................................................................................................................................................................................... 18 
Session 2 Overview: Processors ................................................................................................................. 19 
Session 2 Highlights: Processors ................................................................................................................ 20 
Session 3 Overview: Amplifiers and Analog Front-Ends............................................................................ 22 
Session 3 Highlights: Amplifiers and Analog Front-Ends .......................................................................... 23 
Session 4 Overview: Analog Techniques .................................................................................................... 24 
Session 4 Highlights: Analog Techniques ................................................................................................... 25 
Session 5 Overview:  Front-End Circuits for High-Performance Transceivers ......................................... 26 
Session 5 Highlights:  Front-End Circuits for High-Performance Transceivers ....................................... 27 
Session 6 Overview: Imagers and Displays ................................................................................................ 28 
Session 6 Highlights: Imagers and Displays ............................................................................................... 29 
Session 7 Overview: Ultra-High-Speed Wireline ......................................................................................... 31 
Session 7 Highlights: Ultra-High-Speed Wireline ........................................................................................ 32 
Session 8 Overview: Digital Techniques for System Adaptation, Power Management and Clocking ..... 34 
Session 8 Highlights: Digital Techniques for System Adaptation, Power Management and Clocking ... 35 
Session 9 Overview: Ubiquitous Power Delivery ........................................................................................ 36 
Session 9 Highlights: Ubiquitous Power Delivery ...................................................................................... 37 
Session 10 Overview: Transceiver Chipsets for Communication and Radar ............................................ 38 
Session 10 Highlights: Transceiver Chipsets for Communication and Radar .......................................... 39 
Session 11 Overview:  RF and mm-Wave Wireless Receivers ................................................................... 40 
Session 11 Highlights:  RF and mm-Wave Wireless Receivers ................................................................. 41 
Session 12 Overview:  Innovations from Outside the (ISSCC) Box ........................................................... 42 

© COPYRIGHT 2025 ISSCC—DO NOT REPRODUCE WITHOUT PERMISSION 
Session 12 Highlights: .................................................................................................................................. 42 
Session 13 Overview: Cool Computation Circuits ...................................................................................... 44 
Session 13 Highlights: Cool Computation Circuits..................................................................................... 45 
Session 14 Overview: Compute-In-Memory ................................................................................................ 47 
Session 14 Highlights: Compute-in-Memory ............................................................................................... 48 
Session 15 Overview: Neural Interfaces and Edge Intelligence for Medical Devices ............................... 50 
Session 15 Highlights: Precision neural and cardiac chips ....................................................................... 51 
Session 16 Overview: Highlighted Chip Releases:  Digital and Machine Learning Processors .............. 53 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 54 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 55 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 56 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 57 
Session 17 Overview: Hardware Security .................................................................................................... 58 
Session 17 Highlights: Hardware Security .................................................................................................. 59 
Session 18 Overview:  Noise-Shaping and SAR-Based ADCs ................................................................... 60 
Session 18 Highlights:  Noise-Shaping and SAR-Based ADCs ................................................................. 61 
Session 19 Overview:  Frequency Synthesizers and Series-Resonance VCOs ........................................ 62 
Session 19 Highlights:  Frequency Synthesizers and Series-Resonance VCOs ...................................... 63 
Session 20 Overview:  Sensors and Actuators for Health and Autonomy ................................................ 64 
Session 20 Highlights:  Sensors and Actuators for Health and Autonomy............................................... 65 
Session 21 Overview: Compute & USB Power ............................................................................................ 66 
Session 21 Highlights: Compute & USB Power .......................................................................................... 67 
Session 22 Overview: Memory Interface ...................................................................................................... 68 
Session 22 Highlights: Memory Interface .................................................................................................... 69 
Session 23 Overview: AI Accelerators ......................................................................................................... 70 
Session 23 Highlights: AI Accelerators ....................................................................................................... 71 
Session 24 Overview: High-Frequency ADCs ............................................................................................. 73 
Session 24 Highlights: High-Frequency ADCs ............................................................................................ 74 
Session 25 Overview: High Concepts at High Frequencies ....................................................................... 75 
Session 25 Highlights: High Concepts at High Frequencies ...................................................................... 76 
Session 26 Overview:  Wireless Transmitters and Front-Ends .................................................................. 77 
Session 26 Highlights:  Wireless Transmitters and Front-Ends ................................................................ 78 
Session 27 Overview: Sensor Interfaces ..................................................................................................... 79 
Session 27 Highlights: Scalable Sensor Interface ...................................................................................... 80 
Session 28 Overview: Capacitive Sensor Readout ..................................................................................... 81 
Session 28 Highlights: Capacitive Displacement Sensor ........................................................................... 82 
Session 29 Overview: SRAM ........................................................................................................................ 83 
Session 29 Highlights: SRAM ....................................................................................................................... 84 
Session 30 Overview: Nonvolatile Memory and DRAM .............................................................................. 85 
Session 30 Highlights: Nonvolatile Memory and DRAM ............................................................................. 86 

© COPYRIGHT 2025 ISSCC—DO NOT REPRODUCE WITHOUT PERMISSION 
Session 31 Overview: Energy Harvesting and IoT Power .......................................................................... 88 
Session 31 Highlights: Energy Harvesting and IoT Power ......................................................................... 89 
Session 32 Overview: Isolated Power and Gate Drivers ............................................................................. 90 
Session 32 Highlights: Isolated Power and Gate Drivers ........................................................................... 91 
Session 33 Overview: Components for Beyond 100GHz ............................................................................ 92 
Session 33 Highlights: Components for Beyond 100GHz .......................................................................... 93 
Session 34 Overview:  Digital PLLs and Waveform-Shaping VCOs .......................................................... 94 
Session 34 Highlights:  Digital PLLs and Waveform-Shaping VCOs ......................................................... 95 
Session 35 Overview: Implantable and Wearable Biomedical Devices ..................................................... 96 
Session 35 Highlights: Photoacoustic/Implant RXs.................................................................................... 97 
Session 36 Overview: Ultra-High-Density D2D  and High-Performance Optical Transceivers ................ 98 
Session 36 Highlights: Ultra-High-Density D2D  and High-Performance Optical Transceivers ............... 99 
Session 37 Overview: Design-Technology Optimization and Digital Accelerators ................................ 101 
Session 37 Highlights: Design-Technology Optimization and Digital Accelerators ............................... 101 
Trends .......................................................................................................................................................... 103 
Conditions of Publication ........................................................................................................................... 104 
PREAMBLE ....................................................................................................................................................................................... 104 
FOOTNOTE ....................................................................................................................................................................................... 104 
Analog – 2025 Trends.................................................................................................................................. 106 
Power Management – 2025 Trends ............................................................................................................ 108 
Data Converters – 2025 Trends .................................................................................................................. 110 
RF – 2025 Trends ......................................................................................................................................... 115 
Wireless – 2025 Trends ............................................................................................................................... 118 
Wireline – 2025 Trends ................................................................................................................................ 121 
Digital Architectures & Systems (DAS) – 2025 Trends ............................................................................. 127 
Digital Circuits – 2025 Trends ..................................................................................................................... 131 
Memory – 2025 Trends ................................................................................................................................ 133 
Security – 2025 Trends ................................................................................................................................ 137 
IMD – 2025 Trends (Medical) ....................................................................................................................... 139 
IMD – 2025 Trends (Imagers) ...................................................................................................................... 140 
IMD – 2025 Trends (Displays) ..................................................................................................................... 141 
Technology Directions – 2025 Trends ....................................................................................................... 142 
INDEX ........................................................................................................................................................... 143 
Technical Topics Mapped to Papers .................................................................................................................................................. 144 
Selected Presenting Companies/Institution Mapped to Papers ......................................................................................................... 144 
Contact Information .................................................................................................................................... 149 
 
  

Preamble

FAQ on ISSCC

What is ISSCC?

ISSCC (International Solid-State Circuits Conference) is the flagship conference of the IEEE Solid-State Circuits Society. According to

the Semiconductor Industry Association (SIA), in 2023, despite a cyclical market downturn that lingered early in the year, global sales

rebounded the second half of the year to reach $527 billion. Nearly 1 trillion semiconductors were sold globally, more than 100 chips for

every person on earth. With the downturn now over and demand for semiconductors high, industry analysts project double-digit annual

growth in the upcoming years. According to the World Semiconductor Trade Statistics (WSTS) Spring 2024 Semiconductor Industry

Forecast, a 16.0 percent growth is expected in the global semiconductor market compared to the previous year. The updated market

valuation for 2024 is estimated at US$611 billion. Looking ahead to 2025, WSTS forecasts a 12.5 percent growth in the global

semiconductor market, reaching an estimated valuation of US$687 billion. ISSCC continues to be the premier technical forum for

presenting advances in solid-state circuits and systems whose global market is increasing rapidly.

Who Attends ISSCC?

Attendance at ISSCC 2025 is expected to be around 2800. Corporate attendees from the semiconductor and system industries

typically represent around 60%.

Where is ISSCC?

The 72nd ISSCC will be held in-person with some online offering from February 16th through February 20th, 2025.

Are there Keynote Speakers?

After a day devoted to educational events, ISSCC 2025 begins formally on Monday, February 17, 2024, with four exciting plenary talks:

• Navid Shahriari, Senior Vice President, Foundry Technology Development, Intel

• Daniela Rus, Professor, EECS, Massachusetts Institute of Technology

• Jaihyuk Song, Corporate President & CTO, Device Solutions, Samsung Electronics

• Peter Schiefer, President/CEO, Infineon Technologies

What is the Technical Coverage at ISSCC?

ISSCC covers a full spectrum of design approaches in advanced technical areas broadly categorized as: (1) Communication Systems,

(2) Analog Systems, (3) Digital Systems, and (4) Innovations including micro-machines and MEMS, imagers, sensors, biomedical

devices, as well as forward-looking developments that may take three or more years for commercialization.

How are ISSCC Papers Selected?

For ISSCC 2025 around 914 submissions were received across the broad spectrum of specified topics. The submissions were reviewed

by a team of over 190 scientific and industry experts from the Far-East, Europe, and North America. These experts are organized into

12 Sub-Committees that cover the 4 broad areas described earlier:

• Communication Systems includes Wireless, RF, and Wireline Subcommittees

• Analog Systems includes Analog, Power Management, and Data Converter Subcommittees

• Digital Systems includes Memory, Digital Circuits, Digital Architectures and Systems, and Security Subcommittees

• Innovative Topics includes Imagers/MEMS/Medical Devices/Displays and Technology Directions Subcommittees

What Companies are Presenting this year?

Companies presenting papers at ISSCC 2025 include:

AMD, Analog Devices, Broadcom, Baker Hughes, Cadence, Co-Sensing, d-Matrix, IBM, Infineon Technologies, Intel, Kandou Bus,

KIOXIA, Ludwig Computing, Marvell, MediaTek, Meta, Novelda, Nvidia, SambaNova, Samsung, SKhynix, Semiconductor, Realtek

Semiconductor, Sony, Synopsos, Teledyne DALSA, Texas Instruments, TSMC, Vango, and Western Digital, just to name a few.

A more complete list can be found in the Index.

Are there educational sessions?

ISSCC features a variety of educational events which include:

• Ten Tutorials (targeted toward participants looking to broaden their horizon)

• Six Forums (targeted toward experts in an information sharing context)

• One Short Course (targeted toward in-depth appreciation of a current hot topic)

Are There Other Events?

A more complete list of all activities at ISSCC 2025:

• Four Plenary Presentations

• Eight Invited Talks, consisting of four Industry Talks on Highlighted Chip Releases

and four Talks on Innovative Out-of-the-box Ideas.

• Technical Sessions (37 distinct sessions)

• Four Evening Events and Panels, including:

o Student Research Preview (for the introduction of graduate-student research-in-progress)

o Quantum Computing: Whose Qubit is Better?

o Future of Analog Design: Still Magical or Mostly Digital?

o The Next Decade of AI – Barriers, Opportunities, & Directions

• Educational Sessions Featuring:

o Ten Tutorials

o Six Forums

o One Short Course

• Demonstration Sessions from Academia and Industry

• Exhibition (Corporation & Research Institutions)

• Networking Events

• Author Interview Sessions

• A Number of University Alumni Events

How Do I Use this Press Kit?

The Press Kit provides a PREAMBLE section that features this FAQ and other general information. The kit also includes SESSION

OVERVIEWS AND HIGHLIGHTS of all 37 technical sessions into which the 255 papers are grouped, together with brief descriptions and

context for each. As well, there is an abstract for each of the Plenary talks. For your convenience, the Kit includes two structural charts

in the INDEX section: (a) a list of the 4 Technical Topics and their associated Subcommittees and Sessions; (b) a list of contributing

companies and institutions with their associated papers. Thus, to locate information of interest you can access Chart 4.1 to identify

sessions of interest, after which you might logically access its Session’s Overview or Highlight section. Alternatively, if your interest is in

a particular organization then Chart 4.1 will direct you immediately to papers of interest each of which is detailed in its corresponding

Session Overview and possibly in the Highlights section. For anyone’s interest it is useful to use Chart 4.1 to access the appropriate

Trends information which provides a broad historical view of the context of your interest and often includes reference to current ISSCC

2025 papers.

Anything New This Year?

ISSCC will hold an invited Industry Track (Session 16) which will highlight recent hot-product releases from Broadcom, FuriosaAI,

Samsung Electronics, and SambaNova Systems, and will discuss innovative ways they solved product-level challenges. ISSCC will also

hold an invited session (Session 12) that will highlight some of the new and important developments outside of the circuit community that

can have profound impact on the solid-state circuits society either through emergence of new applications or via offering of new platforms

for processing, communications, and sensing.

Overview: ISSCC 2025 – “The Silicon Engine Driving the AI Revolution”

Now in its 72nd year, ISSCC is the flagship conference in solid-state circuit design. ISSCC reports innovative ideas that will fuel the

continued rapid evolution of integrated circuits (ICs). The IC design community has proven to be agile and resilient in a world scarred by

the worldwide pandemic, and for the 72nd edition of ISSCC we are focusing, more than ever, on how IC innovation can drive and support

the AI revolution.

• .

Plenary Session (Session 1)

The Plenary Session on the morning of February 17, 2025, will feature four renowned speakers:

• Navid Shahriari, Senior Vice President, Foundry Technology Development, Intel

• Daniela Rus, Professor, EECS, Massachusetts Institute of Technology

• Jaihyuk Song, Corporate President & CTO, Device Solutions, Samsung Electronics

• Peter Schiefer, President/CEO, Infineon Technologies

Highlights of these Plenary talks are provided in the following section.

ISSCC 2025

PLENARY SESSION – INVITED PAPERS

Plenary Session — Invited Papers

Chair: Edith Beigné, Meta, Menlo Park, CA

ISSCC Conference Chair

Associate Chair: Thomas Burd, Advanced Micro Devices, Santa Clara, CA

ISSCC International Technical-Program Chair

1.1 AI Era Innovation Matrix

Navid Shahriari, Senior Vice President, Foundry Technology Development, Intel, Chandler, AZ

AI holds transformative potential for humanity, enhancing our ability to solve complex problems with speed and accuracy, and

unlocking new realms of innovation and understanding. The lightning-fast progression of AI, unprecedented in history, necessitates

rapid advancements at a system level, from low-power and edge-AI devices to cloud-based computing, and in the communication

networks that connect them. This need for rapid AI system scaling is driving the innovation frontier in silicon, packaging, architecture,

and software. This paper describes a matrix of technologies that empowers the industry to achieve remarkable progress at every level,

from chips to systems.

1.2 From Chips to Thoughts: Building Physical Intelligence into Robotic Systems

Daniela Rus, Massachusetts Institute of Technology, Director,

CSAIL & Andrew (1956) and Erna Viterbi Professor, Cambridge, MA

In today’s robot revolution, a record 3.1 million robots are now working in factories, doing everything from assembling computers to

packing goods and monitoring air quality and performance. A far greater number of smart machines impact our lives in countless other

ways—improving the precision of surgeons, cleaning our homes, extending our reach to distant worlds—and we are on the cusp of even

more exciting opportunities. Future machines, enabled by recent advances in AI, will come in diverse forms and materials, embodying a

new level of physical intelligence. Physical Intelligence is achieved when the power of AI is to understand text, images, signals, and other

information is used to make physical machines such as robots intelligent. However, a critical challenge remains: balancing the capabilities

of AI with sustainable energy usage. To achieve effective physical intelligence, we need energy-efficient AI systems that can run reliably

on robots, sensors, and other edge devices. In this paper I will discuss the energy challenges of transformer-based foundational AI

models, I will introduce several state space models, and explain how they achieve energy efficiency, and how state-space models enable

physical intelligence.

Plenary Session — Invited Papers

1.3 AI Revolution Driven by Memory Technology Innovation

Jaihyuk Song, Corporate President & CTO, Device Solutions, Samsung Electronics, Hwaseong, South Korea

The recent AI revolution, spearheaded by Large Language Models (LLMs), demands substantial computing resources and corresponding

memory solutions. However, unlike processors that can leverage advancements in fabrication processes, memory devices are

increasingly struggling to meet the high bandwidth, large capacity, and power efficiency requirements of AI systems.

This paper analyzes the requirements and limitations of systems in the AI era, categorizing application-specific memory needs in terms

of performance, power, and capacity. We introduce performance-centric solutions such as HBM (High Bandwidth Memory) and PIM

(Processing-In-Memory) technologies, energy-efficient solutions including custom HBM and LPW (LPDDR Wide-IO) memory, and

capacity-focused solutions like SSD (Solid-State Drives) and CXL (Compute Express Link) Memories. Additionally, we discuss how

continuous scaling of DRAM and NAND Flash processes, as well as 3D-packaging technologies, can address the trade-offs among

performance, power, and capacity more effectively. Finally, the importance of software technologies in optimizing the utilization of these

increasingly specialized memory solutions is emphasized, along with a discussion of the enabling core technologies for each solution.

To meet the high demands of AI systems, the ongoing advancement of existing memory devices and the development of new memory

solutions will play crucial roles. These efforts will support the advancement of AI technologies and contribute to human society.

1.4 The Crucial Role of Semiconductors in the Software-Defined Vehicle

Peter Schiefer, President & CEO, Infineon Technologies, Munich, Germany

The automotive industry is undergoing a significant transformation, driven by the rise of software-defined vehicles (SDVs).

Semiconductors will play a pivotal role in enabling this transition, powering the complex systems that underpin the features and functions

of modern cars.

This paper explores the key trends driving the growth of the automotive semiconductor market, including green mobility, autonomous

driving, and smarter cars. It delves into the challenges and opportunities associated with the development of SDVs, highlighting the

importance of advanced microelectronics, artificial intelligence, and secure communication solutions.

The paper concludes by emphasizing the crucial role of semiconductors in shaping the future of mobility. By addressing the challenges

and embracing the opportunities presented by SDVs and AI, the automotive industry can create a more sustainable and innovative future.

ISSCC 2025

EVENING EVENTS

Evening Events (EE)

ISSCC 2025 will continue the popular tradition of evening sessions where experts often of opposing views, discuss topics

which range from the lighthearted to the controversial (but always informative and entertaining!). This year’s are “Quantum

Computing: Whose Qubit is Better?”, “Future of Analog Design: Still Magical or Mostly Digital?”, and “The Next Decade of AI

– Barriers, Opportunities, & Directions”.

In addition, ISSCC 2025 will include additional special events including “Student Research Preview” and a networking and

mentoring session.

Sunday, February 16

Bingo Networking Event

Women in Circuits (WiC) together with ISSCC will be holding a networking and mentoring session on Sunday afternoon. Distinguished

WiC members, and other participants will play getting-to-know-you bingo to promote engagement between various members of the

community. This will give participants the chance to network and mingle with people across a spectrum of seniority in the field in a casual

setting. This event is open to all ISSCC attendees and the public.

EE1: Student Research Preview

The Student Research Preview (SRP) will highlight selected student research projects in progress. The SRP consists of 24 sixty-second

presentations followed by a Poster Session, by graduate students from around the world, which have been selected on the basis of a

short submission concerning their on-going research. Selection is based on the technical quality and innovation of the work. This year,

the SRP will be presented in three theme sections: 1) Power and Analog, 2) ML, Cryogenic and Ising Machine, and 3) Data

Communications and Frequency Generation.

The SRP will include an inspirational lecture by Professor Jan Rabaey (UC Berkeley). SRP begins at 8:00 pm on Sunday, February

16th. It is open to all ISSCC registrants.

Tuesday, February 18

EE2: Quantum Computing: Whose Qubit is Better?

Quantum computing utilizes principles of quantum mechanics to solve intricate computational problems more efficiently than classical

computers. Various qubit technologies are being developed for this purpose, such as superconducting qubits, trapped ions, and silicon-

spin qubits. Each approach has its supporters and critics with prominent industry players investing heavily in one or more of these

technologies. The race is on to determine which qubit technology will emerge as the dominant choice. Be prepared to participate in a

discussion with our panel of experts as they weigh in on the advantages and challenges associated with each of the innovative

approaches to determine whose qubit will rise to the top.

Tuesday, February 21

EE3: Future of Analog Design: Still Magical or Mostly Digital?

A gradual paradigm shift has occurred over the past four decades for implementing analog functions in IC design from classical analog

circuits to digital-intensive circuits. The emergence of new digital architectures has further prompted explosive growth in digital circuits

providing analog functionality, including digital radios, transmitters (TXs), receivers (RXs), phase-locked loops (PLLs), low-dropout

voltage regulators (LDOs), and time-based analog-to-digital converters (ADCs). The success of these new digital approaches has been

further driven by improvements in area, power, and design time in advanced technology nodes. Classical analog circuits, however, often

provide superior performance as compared to their digital counterparts. In this panel, experts across industry and academia will explore

this digital/analog divide and provide insights into this IC design trade-off, including in the context of emerging technology trends.

EE4: The Next Decade of AI – Barriers, Opportunities, & Directions

The past decade has witnessed dramatic advances in Artificial Intelligence (AI) hardware, software, and models, paving the way for AI

applications that are shaping many facets of our everyday life. As AI ushers us into the next boom in autonomy, productivity, and

creativity, there are critical challenges that lie ahead such as energy consumption, costs, data availability, and government regulations

to name a few. Which of these or other potential barriers will limit AI progress in the next decade? What will be the unintended

consequences of the emergence of AI and what can we do to curb these? Which opportunities may arise from innovating solutions to

overcome these challenges? Our evening panel of experts in AI hardware, software, models, and government policy offer insightful

perspectives on these and other consequential questions that will influence the course of AI in the coming decade.

Session Overviews and Highlights

ISSCC 2025

SESSION OVERVIEWS

AND HIGHLIGHTS

Conditions of Publication

PREAMBLE

The Session Overviews and Highlights to follow serve to capture the context, highlights, and potential impact, of the papers to be

presented in each Session at ISSCC 2025 in February.

OBTAINING COPYRIGHT to ISSCC press material is EASY!

You may quote the Subcommittee Chair as the author of the text if authorship is required.

You are welcome to use this material, copyright- and royalty-free, with the following understanding:

o That you will maintain at least one reference to ISSCC 2025 in the body of your text, ideally retaining the date and

location. For detail, see the FOOTNOTE below.

o That you will provide a courtesy PDF of your excerpted press piece and particulars of its placement to

lcfujino@aol.com and shahriar@ece.ubc.ca

FOOTNOTE

• From ISSCC’s point of view, the phraseology included in the box below captures what we at ISSCC would like your readership

to know about this, the 72nd appearance of ISSCC, on February 16th to February 20th, 2025.

This and other related topics will be discussed at length at ISSCC 2025, the foremost global forum for new

developments in the integrated-circuit industry. ISSCC, the International Solid-State Circuits Conference,

will be held on February 16 - February 20, 2025

ISSCC Press Kit Disclaimer

The material presented here is preliminary.

As of November 5, 2024, there is not enough information to guarantee its correctness.

Thus, it must be used with some caution.

Session 2 Overview: Processors

Digital Architectures and Systems Subcommittee

Session Chair: Nathaniel Pinckney, Nvidia, Austin, TX

Session Co-Chair: Jie Gu, Northwestern University, IL

Processors are the driving workhorses of modern high-performance computing. This session presents the next-generation processors

for both general-purpose computing and specialized computing. The first three papers deliver the next-generation CPU processors and

networking processors, followed by a 20-chiplet heterogenous 2.5D system in the fourth paper. The remaining papers present the state-

of-the-art super-resolution image and video processors, as well as advanced 3D rendering processors.

• In Paper 2.1, AMD presents Zen5, their next-generation x86-64 microprocessor core, targeting both desktop and laptop applications. Built in

TSMC’s 4nm FinFET process, the 55mm2 core complex integrates 8.6B transistors across 8 CPU cores. Each core delivers a generational

~16% IPC improvement over Zen4, while maintaining a maximum frequency of 5.7GHz.

• In Paper 2.2, IBM showcases their next-generation 5.5GHz Telum II microprocessor with an enhanced AI accelerator and a new Data

Processing Unit for I/O offload. The 600mm2 die with 43B transistors in 5nm benefits from +40% L2 cache, on-chip voltage control, +36% X-

BUS bandwidth, and reliability enhancements, while staying within 5% power of the prior generation.

• In Paper 2.3, Intel describes the integration of their Xeon 6 SoC, Granite Rapids-D. In exploring the I/O die built in Intel 4 technology, details

of the 200Gbps Enterprise-Grade Ethernet, 32 lanes of 32Gbps PCIe5, and 16 lanes of PCIe4 are provided. Hardware accelerators

demonstrate 1.8× improvement in system-level benchmarks over prior generations.

• In Paper 2.4, Intel presents a 2.5D package integration approach with workload-dependent configurations. The system features 20 chiplets

from two different foundries assembled over a passive silicon base (22mm×19mm, >1650000mbumps, 120 power domains) with a power of

20.52W at 300MHz and running deep learning inference at a peak of 72TOPS.

• In Paper 2.5, National Tsing Hua University and Taiwan Semiconductor Research Institute present a 16nm CNN processor supporting

bidirectional FPN for small-object detection on high-resolution videos, achieving 5.7TOPS for 26.4% COCO mAPs on 896×896 inputs at

26.6fps with 1373mW at 1.09V.

• In Paper 2.6, Tsinghua University presents a 28nm 2.43mm2 3D Gaussian splatting processor for multi-scenario rendering on edge devices.

The work exploits dynamic reconfiguration through hybrid rasterizing and interpolating to achieve 1.96× higher peak throughput, while

delivering 7.9× lower energy per frame over state-of-the-art 3D rendering accelerators.

• In Paper 2.7, KAIST university presents a 28nm 20.25mm2 3D Gaussian splatting SoC for real-time interactive rendering. The spatial

computing SoC achieves 72fps real-time rendering with 34× faster performance and 300× lower energy than an edge GPU.

• In Paper 2.8, National Taiwan University and Google present an image signal processor for true video super-resolution (VSR) by utilizing

inter-frame dependencies. The 40nm 6.8mm2 chip delivers 210fps for 4K UHD with 3.5× higher frame rate and 7.3× better energy efficiency

than previous works.

• In Paper 2.9, National Tsing Hua University and TSMC present an 16nm 8mm2 space-time resolution enhancement neural-network processor

for next-generation display and streaming, achieving 8K-UHD 60fps inference with 10.2TOPS and 1425mW at 400MHz.

• In Paper 2.10, Fudan University presents a 22nm 6mm2 low-power (0.52mJ/frame), high-throughput (1K@107fps) super-resolution

processor. It uses a channel-number-adaptive caching strategy to reduce on-chip memory by 90%, a workload-balance engine cutting

execution cycles by 64%, and a hybrid data flow improving hardware utilization by 75%.

Session 2 Highlights: Processors

[2.1] “Zen 5”: The AMD High-Performance 4nm x86-64 Microprocessor Core

[2.2] IBM Telum II: Next Generation 5.5GHz Microprocessor with On-Die Data

Processing Unit and Improved AI Accelerator

[2.3] Granite Rapids-D: Intel Xeon 6 SoC for vRAN, Edge, Networking, and Storage

[2.6] A 1.78mJ/Frame 373fps 3D GS Processor Based on Shape-Aware Hybrid

Architecture Using Earlier Computation Skipping and Gaussian Cache Scheduler

[2.8] A 210fps Image Signal Processor for 4K Ultra HD True Video Super-Resolution

Paper 2.1 Authors: Teja Singh1, Sundar Rangarajan1, Shane Southard1, Alex Schaefer1, Sarah Tower1, Spence Oliver1, Kathy

Hoover1, Deepesh John1, Ted Antoniadis1, Shravan Lakshman1, Vibhor Mittal1, Brian Kasprzyk1, Ross McCoy1, Kurt Molhman1, Russ

Schreiber1, Sahilpreet Singh2, Carson Henrion3, Brett Johnson3, Nick Lance3, Darryl Prudich3, Justin Coppin3, Tim Jackson3, Anita

Karegar3, Ryan Miller3, Sabeesh Balagangadharan4, Hon-Hin Wong5, Wilson Li5, Michael McCabe5, James Pistole5, Daryl Lieu5

Paper 2.1 Affiliation: 1AMD, Austin, TX, 2AMD, Markham, ON, Canada, 3AMD, Fort Collins, CO, 4AMD, Bangalore, India, 5AMD, Santa

Clara, CA

Paper 2.2 Authors: Gerald Strevig1, Chris Berry2, Rahul Rao3, Noam Jungmann4, Michael Sperling2, Michael Becht2, Eduard Herkel5,

Matthias Pflanz5, Pat Meaney2, Michael Romain2, Mark Cichanowski1, Amanda Venton1, David Wolpert2, Elazar Kachir4, Luke

Hopkins2, Tim Bubb2, Andreas Arp5, Daniel Kiss5, Simon Büchsenstein5, Michael Wood2, Michael Spear1, Robert Sonnelitter2, Rajiv

Joshi6

Paper 2.2 Affiliation: 1IBM Infrastructure, Austin, TX, 2IBM Infrastructure, Poughkeepsie, NY, 3IBM Infrastructure, Bangalore, India,

4IBM Infrastructure, Tel Aviv, Israel, 5IBM Infrastructure, Böblingen, Germany, 6IBM Research, Yorktown Heights, NY

Paper 2.3 Authors: Raj R Varada1, Rohini Krishnan2, Ajith Subramonia2, Rathish Chandran2, Kalyana Chakravarthy2, Uttpal D Desai2,

Yun Kim1, Puneesh Puri2, David R Mulvihill3, Mike Bichan4, Vijayalakshmi Ramachandran5

Paper 2.3 Affiliation: 1Intel, Santa Clara, CA, 2Intel, Bengaluru, India, 3Intel, Fort Collins, CO, 4Intel, Toronto, ON, Canada, 5Intel,

Chandler, AZ

Paper 2.6 Authors: Xiaoyu Feng, Hedi Wang, Chen Tang, Huazhong Yang, Yongpan Liu

Paper 2.6 Affiliation: Tsinghua University, Beijing, China

Paper 2.8 Authors: Ying-Sheng Lin1, Jun Nishimura2, Chia-Hsiang Yang1

Paper 2.8 Affiliation: 1National Taiwan University, Taipei, Taiwan, 2Google, Mountain View, CA

Subcommittee Chair: Rahul Rao, IBM, Bangalore, India, Digital Architectures and Systems

CONTEXT AND STATE OF THE ART

● Slowing generational process scaling is bolstered by architectural improvements and AI accelerator integration, bringing

respectable IPC uplifts and sizable AI workload improvements.

● Lossy video encodings have been supercharged by machine learning to save on bandwidth and energy, while custom SoCs

focus on edge rendering.

● Multi-foundry 2.5D packaging integration allow for assembly-time configurations to better meet application demands.

TECHNICAL HIGHLIGHTS

● AMD presents Zen 5, their next-generation x86-64 microprocessor core.

o Microarchitectural improvements combined with L1/L3 cache size increases deliver a ~16% IPC increase in the

5.7GHz CPU core, while technology scaling combines with physical optimizations to achieve a 30% increase in

transistor density vs. Zen4.

● IBM demonstrates their 5nm Telum II microprocessor with an enhanced AI accelerator and new IO engine.

o Support for INT8 AI acceleration increases system compute performance to 768TOPs with 30 higher AI

TOPs/thread, while overall IPC per Watt increases by 1.3.

● Intel describes the integration of their Xeon 6 SoC, Granite Rapids-D.

o Designed to support both server-grade and edge applications, the SoC showcases a 200Gbps enterprise-class

Ethernet, 48 PCIe lanes with a total bandwidth of 189GB/s, and hardware accelerators delivering a 1.8 performance

lift.

● Tsinghua University demonstrates a Gaussian splatting processor for multi-scenario rendering aimed at next-

generation 3D rendering applications on power-constrained edge-devices.

o The processor achieves 1.96 higher peak throughput, 3.18 higher area-normalized throughput and 7.9 lower

energy per frame over SOTA 3D rendering accelerators, through a combination of dynamic reconfiguration and

cache-based spatio-temporal scheduling.

● National Taiwan University and Google present an image signal processor for true video super-resolution (VSR),

utilizing inter-frame dependency in encoded videos.

o The chip achieves a maximum frame rate of 210fps for 4K UHD, with 3.5 higher frame rate and 7.3 greater energy

efficiency than prior multi-image super-resolution designs, consuming 344 mW at 200MHz from a 0.98V supply.

APPLICATIONS AND ECONOMIC IMPACT

● Disaggregation of traditionally monolithic die into chiplet-based systems continues with increasing diversity into a wider range

of applications.

● Super-resolution video accelerators driven by machine learning techniques are rapidly increasing in frames per second and

resolution under tight edge energy budgets, while saving significantly on bandwidth.

● Real-time 3D edge processing unlocks potential in virtual reality and embodied artificial intelligence applications.

Session 3 Overview: Amplifiers and Analog Front-Ends

Analog Subcommittee

Session Chair: Jiawei Xu, Fudan University, Shanghai, China

Session Co-Chair: Ippei Akita, AIST, Tsukuba, Japan

Amplifiers and analog front-ends continue to achieve high precision without sacrificing power efficiency and stability. The first presentation

describes an audio amplifier utilizing a single-inductor buck-boost power stage to boost output power and achieve a high dynamic range

with low-voltage input devices. The following two presentations focus on improving the bandwidth-power efficiency and temperature

stability of precision amplifiers, respectively. The final paper showcases an on-chip heater-based self-trimming technique to enable easy

and fast post-package trim.

• In Paper 3.1, National Cheng Kung University presents a double-sided voltage-boosting (DSVB) modulation scheme that integrates a capacitively-

coupled chopper amplifier with a single-inductor buck-boost power stage achieving 121.3dB DR, 0.0019% THD+N, 87.2% peak efficiency, and

maximum POUT of 1.6W with an 8Ω load.

• In Paper 3.2, Fudan University introduces a 36V current-balancing instrumentation amplifier (CBIA) with adaptive biasing for improving power-

efficient input range, and negative capacitance for noise reduction and bandwidth extension, achieving 400MHz/µA GBW/IQ in a standard 0.18µm

BCD process.

• In Paper 3.3, Leibniz University Hannover shows a 3-in-1 multi-mode trimming-free amplifier with a Logarithmic Conformity Error (LCE) of 0.75%

over the full temperature range from -25 to 200°C.

• In Paper 3.4, Tsinghua University and TU Delft present an on-chip heater-based 2-temperature self-trimming technique. The CMOS opamp

achieves an offset of ±5.8μV (3σ) at room temperature and an offset drift of ±88nV/°C (3σ) from-40 to 125°C.

Session 3 Highlights: Amplifiers and Analog Front-Ends

[3.1] A 121.3dB-DR, 115dB-PSNR, Digital-Input Capacitive-Feedback Class-D Audio

Amplifier with Double-Sided Voltage-Boosting (DSVB) Modulation

Paper Authors: Jhuang-Wei Cyue, Tai-Haur Kuo

Paper Affiliation: National Cheng Kung University, Tainan, Taiwan

Subcommittee Chair: Viola Schaffer, Texas Instruments Deutschland GmbH, Freising, Germany, Analog

CONTEXT AND STATE OF THE ART

• In power-efficient Class-D audio amplifiers (CDAs), the maximum output power is limited by the available battery voltage.

• Although a capacitively-coupled chopper-amplifier (CCCA) topology-based CDA eliminates the thermal noise from the

resistive feedback, existing feedback-after-LC architecture requires two pairs of LC filters.

TECHNICAL HIGHLIGHTS

• National Cheng Kung University shows a double-sided voltage-boosting (DSVB) modulation scheme that integrates a

CCCA with a single-inductor buck-boost power stage, allowing to use only a single off-chip inductor (1L) and achieve

additional output power-boost capability.

• The Class-D audio amplifier achieves 121.3dB DR, 115dB PSNR, 0.0019% THD+N, 87.2% peak efficiency, and maximum

POUT of 1.6W with an 8Ω load.

APPLICATIONS AND ECONOMIC IMPACT

• The double-sided voltage-boosting (DSVB) modulation scheme not only increases the output power with high efficiency and

high dynamic range, but also effectively reduces modulation-induced common-mode EMI.

• The single-inductor buck-boost power stage leads to smaller PCB area and lower manufacturing costs.

Session 4 Overview: Analog Techniques

Analog Subcommittee

Session Chair: Jiawei Xu, Fudan University, Shanghai, China

Session Co-Chair: Drew Hall, University of California, San Diego, CA

Analog techniques to improve circuit performance, power efficiency, and cost-effectiveness are presented. The first paper introduces a

high-speed ADC front-end, followed by the second paper showing a fully integrated frequency management module. The next two papers

describe new analog conduction angle control techniques for ultra-low-power crystal oscillators, and a highly power-efficient RC oscillator

with an improved temperature coefficient follows. The last paper presents a CMOS-based voltage reference in a 3nm FinFET technology.

• In Paper 4.1, the University of Twente presents a 12.8GS/s 4× time-interleaved track-and-hold front-end. Fabricated in 22nm FDSOI, this ADC

front-end achieves 40.9dB SNDR and 49.6dB SFDR at 30.2GHz input frequency for RF sampling applications without the need for skew

calibration.

• In Paper 4.2, Shanghai Jiao Tong University introduces a 1.8-to-3.0GHz fully integrated frequency management module that employs a direct

frequency synthesis structure, achieving –47/+42ppm inaccuracy from –40 to 95°C and –150/+70ppm after accelerated aging while supporting a

fast frequency hopping with a 7μs settling time.

• In Paper 4.3, Hong Kong University of Science and Technology presents a crystal oscillator that controls its conduction angle by amplitude for

adaptive Class-C operation. This 32.768kHz oscillator consumes 0.36nW power and occupies only 820μm2 of area, the smallest among the state-

of-the-art crystal oscillators.

• In Paper 4.4, the Technical University Munich shows an ultra-low-power 32.768kHz oscillator with an analog amplitude regulation for sub-nano

power consumption of 0.36nW at 0.9V. The 0.047mm2 design in 0.18um CMOS technology enables a large supply voltage range from 0.9 to 2V

with process tracking.

• In Paper 4.5, the University of Electronic Science and Technology of China presents an open-loop reference-replication-based RC oscillator. The

prototyped 10MHz RC oscillator in 65nm CMOS achieves a power efficiency of 0.4μW/MHz, a temperature coefficient of 9.83 ppm/°C from –40

to 125°C, and a die area of 0.0085mm2.

• In Paper 4.6, TSMC presents a single-Vth CMOS-based voltage reference in a 3nm FinFET technology. The circuit provides a ripple-less reference

voltage with a line regulation of 0.04%/V, a power-noise rejection of <–40dB from DC to 1GHz, an untrimmed peak inaccuracy of 2.1%, and a

temperature coefficient of 31ppm/°C at a 0.8V supply while occupying a 294μm2 area.

Session 4 Highlights: Analog Techniques

[4.3] A 0.36nW, 820μm2, 32kHz Conduction-Angle-Adaptive Crystal Oscillator in 28nm

CMOS for Real-Time Clock Applications

Paper Authors: Peng Wang1, Manyu Wang1, Guangnan Dai1, Yujia Cao1,2, Sining Pan2, Yihan Zhang1

Paper Affiliation: 1Hong Kong University of Science and Technology, Hong Kong, Hong Kong, 2Tsinghua University, Beijing, China

Subcommittee Chair: Viola Schaffer, Texas Instruments Deutschland GmbH, Freising, Germany, Analog

CONTEXT AND STATE OF THE ART

● In duty-cycled sensor nodes, the ultra-low-power 32.768kHz crystal oscillator with good frequency stability plays a critical role

as an always-on real-time clock.

● Although pulse-injection crystal oscillators are popular due to their sub-nW power consumption, they require accurate injection

timing control, leading to complexity and area overhead. They also need separate start-up circuits to initiate the oscillation.

TECHNICAL HIGHLIGHTS

● Hong Kong University of Science and Technology and Tsinghua University present a conduction-angle-adaptive

crystal oscillator that controls the conduction angle by amplitude for adaptive Class-C operation, allowing a larger

conduction angle to avoid the overhead caused by the need for generating accurate delays.

o The crystal oscillator consumes a record-low 0.36nW power and occupies only 820μm2 of area, the smallest among

the state-of-the-art crystal oscillators.

o The crystal oscillator also has an intrinsic start-up capability, making it easy to use.

APPLICATIONS AND ECONOMIC IMPACT

● The ultra-low-power real-time clock can reduce the overall power consumption of the duty-cycled sensor node system.

● Small area lowers manufacturing cost, and intrinsic start-up capability makes it readily applicable to the system.

Session 5 Overview:

Front-End Circuits for High-Performance Transceivers

RF Subcommittee

Session Chair: Henrik Sjöland, Ericsson Research and Lund University, Lund, Sweden

Session Co-Chair: Hongtao Xu, Fudan University, Shanghai, China

Front-end circuits continue to evolve, demonstrating higher power efficiency, output power, and supported data-rates, along with increased

robustness to antenna impedance variations and higher integration levels. The first paper in this session presents a GaN PA operating in a

switchless Class-G Doherty continuum, achieving high efficiency over a wide bandwidth. This is followed by four papers showcasing the latest

developments in PA circuits. The next 4 papers focus on digital transmitters that deliver high output power, efficiency, and linearity. The final two

papers are on LNAs, with the last one demonstrating an LNA with improved linearity using a Doherty architecture.

• In Paper 5.1, Tsinghua University presents a GaN PA with high back-off efficiency over a wide frequency range operating in a SLCG-Doherty

continuum. It achieves over 38% efficiency at 6dB back-off from 1.35 to 7.6GHz.

• In Paper 5.2, Shanghai Jiao Tong University introduces a digital PA for beamforming transmitters with enhanced back-off efficiency. This design

in a 0.18µm SiGe technology achieves a peak 6dB back-off efficiency of 16% at 24GHz.

• In Paper 5.3, the University of Electronic Science and Technology of China shows a mm-wave PA with radial power combining. The 56-to-64GHz

PA in a 40nm bulk CMOS technology provides 30.2dBm PSAT with 23.5% PAE.

• In Paper 5.4, Samsung Electronics presents a 5G FR2 Doherty PA using 3-stack amplifier stages with an operation amplifier bias scheme. From

24 to 29GHz the 22nm FDSOI chip delivers a 5G NR 64-QAM OFDM 100MHz signal with Pavg>16.5dBm at an EVM of -25dB.

• In Paper 5.5, ETH Zürich demonstrates a compact and wideband load insensitive PA for large-scale phased arrays. The 22nm SOI design

operates from 34 to 40.8GHz, has a core area of 0.093mm2, and an OP1dB of 17.3dBm with a PAE of 37%.

• In Paper 5.6, the Institute of Science Tokyo presents a CORDIC-less digital polar transmitter using only 1b DSM-based PA and 3b PM signal

controls. It can support 50MS/s 256-QAM while achieving a PAE over 20% at 1.35GHz.

• In Paper 5.7, the University of Electronic Science and Technology of China introduces a polar SCPA using slope-to-phase self-calibration, which

achieves 27.7dBm peak Pout and 37.8% peak PAE at 4.7GHz. It can support 160MHz 64-QAM and 80MHz 256-QAM signals without a DPD.

• In Paper 5.8, the Delft University of Technology demonstrates a 20W CMOS/LDMOS digital transmitter with dynamic retiming and a glitch-free

phase mapper, achieving 68%/62% Peak Drain/System Efficiency at 1.7GHz.

• In Paper 5.9, Xidian University shows a 21-to-31GHz DPD-less quadrature RFDAC with invariant impedance and scalable LO leakage. It can

support 2.4Gb/s 256-QAM without a DPD and high-order modulation up to 4096-QAM.

• In Paper 5.10, the University of California, San Diego presents a mm-wave low-noise active bandpass filter employing an all-passive interferer

cancellation feedforward path. It consumes 3.5mW from 22 to 27GHz and achieves a noise figure of 3.8dB and an EVM<0.84% for up to 1024-

QAM 800MHz-bandwidth signals.

• In Paper 5.11, the University of Texas, Austin demonstrates a blocker-tolerant mm-wave low-noise amplifier utilizing Doherty load modulation. It

has 2.5dB NF at 27.2GHz and supports 400MHz 64-QAM with a -13.7dBm in-band blocker.

Session 5 Highlights:

Front-End Circuits for High-Performance Transceivers

[5.1] A GaN SLCG-Doherty-Continuum Power Amplifier Achieving >38% 6dB Back-Off

Efficiency over 1.35 to 7.6GHz

[5.6] A Power-Efficient CORDIC-less Digital Polar Transmitter Using 1b DSM-Based PA

Supporting 256-QAM

Paper 5.1 Authors: Guansheng Lv, Wenhua Chen, Xiaofan Chen, Zhenghe Feng

Paper 5.1 Affiliation: Tsinghua University, Beijing, China

Paper 5.6 Authors: Yuncheng Zhang, Zezheng Liu, Duo Li, Minzhe Tang, Yi Zhang, Hongye Huang, Dingxin Xu, Waleed Madany,

Ashbir Aviat Fadila, Wenqian Wang, Yuang Xiong, Daxu Zhang, Garry Pranata Kusuma, Hiroyuki Sakai, Kazuaki Kunihiro, Atsushi

Shirane, Kenichi Okada

Paper 5.6 Affiliation: Tokyo Institute of Technology, Tokyo, Japan

Subcommittee Chair: Brian Ginsburg, Texas Instruments, Allen,TX, RF subcommittee

CONTEXT AND STATE OF THE ART

● Front-end circuits with high power efficiency, providing high signal quality, and low noise and distortion are essential to

applications like 5G communications.

● Transceivers front-ends with wide bandwidths and high integration levels are key to reducing size and cost while supporting

high peak data-rates and operation across multiple frequency bands.

● Innovations on circuit and architecture levels, combined with advancements in semiconductor technologies, continue to

enhance front-end performance and functionality.

TECHNICAL HIGHLIGHTS

● Tsinghua University introduces a GaN PA achieving high back-off efficiency over a wide frequency band by

introducing operation in a switch-less Class-G-Doherty continuum.

o More than 38% 6dB back-off drain efficiency over a 1.35 to 7.6GHz frequency band is obtained. For a 100MHz LTE

signal with a 7.5dB PAPR, the average power is 29.1 to 31dBm with a drain efficiency from 35 to 49.2%.

● Tokyo Institute of Technology presents a CORDIC-less digital polar transmitter with good linearity and power

efficiency.

o The digital polar transmitter uses only a 1b DSM-based PA and 3b PM signal controls. It can support 50MS/s 256-

QAM and exhibits a PAE over 20% at 1.35GHz

APPLICATIONS AND ECONOMIC IMPACT

● Enhanced front-end performance improves the capacity of wireless communication and enables new wireless applications.

● Improved power efficiency extends battery life in mobile devices and reduces electricity costs for infrastructure.

Session 6 Overview: Imagers and Displays

Imagers, Medical, and Displays Subcommittee

Session Chair: Leonardo Gasparini, FBK, Trento, Italy

Session Co-Chair: Hyung-Min Lee, Korea University, Seoul, Korea

Image sensor technology continues to evolve in two main directions: (1) high-resolution, high-dynamic-range, high-speed imaging using 3D integrated technology, and

(2) low-power, background-resilient depth sensing using SPADs. The first paper introduces a high-resolution imager with hybrid shutter operation. This is followed by

two SPAD-based time-of-flight sensors, excelling in low power consumption, and immunity to strong background light, respectively. Finally, three imagers with pixel-

parallel ADC architectures are presented, offering high-quality global-shutter solutions for consumer cameras, AR/VR systems, and LW-IR imaging.

Display technology continues to advance in two main directions: (1) source-driver ICs with uniform, high color depth, area- and power-efficient design, and (2) touch

analog front-end ICs with high SNR and frame rate. The first three papers introduce different source-driver ICs: the first is designed for OLED-on-Silicon (OLEDOS)

displays with uniform outputs, the second features an OLED driver IC with a real-time pixel compensator, and the third focuses on a compact OLED driver IC utilizing

a delta-sigma modulation technique. The final paper presents an EMI-reduced touch analog front-end IC for CISPR 25 class 5 for automotive displays.

• In Paper 6.1, Samsung Electronics describes a 3-layer stacked CMOS Image Sensor providing switchable operation between a 50 Mpixel rolling shutter and a

12.5 Mpixel global shutter. The sensor provides a triple conversion gain, a FWC of 52 ke- and a RN as low as 2.4 e- in global shutter mode.

• In Paper 6.2, a joint development led by Ulsan National Institute of Science and Technology presents a 160×90 Flash LiDAR with variable frame rate, between 5

and 250fps, based on the available pixel SNR, through a combination of quaternary-search and indirect time-of-flight. The LiDAR system achieves 6cm precision

and 8cm accuracy in the range 1.5 to 22.5m, sustaining up to 30klux background light.

• In Paper 6.3, a joint development led by Yonsei University describes a 64×64-pixel, SPAD-based indirect time-of-flight sensor with 120klux background light

rejection capabilities thanks to the integration of a chopper in the in-pixel analog counter. The sensor covers a range of 76m with a precision of 6.4cm, and it is

able to acquire depth images at 50fps at 39.4mW power consumption.

• In Paper 6.4, Brillnics and Meta introduce a 3-layer stacked, 400×400-pixel CMOS Image Sensor with per-pixel ADC. By incorporating advanced processing

stages, including a frame averaging unit, an image processing unit, and a sparse readout mechanism, this Digital Pixel Sensor achieves a single-exposure DR of

117dB while consuming 3.06mW at 30fps, and FoM of 0.0049 e-rms∙pJ.

• In Paper 6.5, Sony Semiconductor Solutions reveals a 2-layer stacked, 25.2 Mpixel global shutter CMOS Image Sensor with pixel-parallel 14b ADC. The sensor

can acquire images at 120fps with a random noise of 2.66 e-rms at 0dB gain, achieving a record FoM of 0.083 e-rms∙nJ/step, previously achieved only by rolling-

shutter architectures.

• In Paper 6.6, Peking University presents a 320×256, 120.4dB Long-Wavelength Infrared Focal Plane Array. Leveraging on a pixel-parallel, low-power, 20b ADC

architecture based on a light-current-controlled oscillator, the sensor acquires images at 60fps with 2.2mK NETD, while consuming 6.4mW of power.

• In Paper 6.7, Hanyang University describes a 65nm CMOS source-driver IC designed for 6285-PPI OLED-on-Silicon displays, supporting 4K×4K resolution at a

120Hz frame rate. With all-channel automatic offset calibration and data-difference-dependent dynamic current source, the driver achieves a settling time of

0.69μs, DVO of 1.9mV, and silicon area per channel of 2273μm².

• In Paper 6.8, Korea University presents a 250nm CMOS source-driver IC for OLED with a real-time pixel compensator to minimize pixel variation effects. While

the driver supports the 4T1C pixels, the pixel compensator achieves a silicon area of 4442.8μm² and power consumption of 13.3μW per channel, utilizing 6

channels per single-pixel compensator.

• In Paper 6.9, a joint development led by Korea University presents an area efficient 10b source-driver IC with delta-sigma modulation interpolation DAC using

65nm CMOS process. The delta-sigma modulation DAC consists of low-voltage transistors and supports high linearity, achieving the smallest silicon area of

1884μm² and static current of 1.5μA per channel.

• In Paper 6.10, a joint development led by Sungkyunkwan University introduces an 80nm CMOS electromagnetic interference (EMI) tolerant automotive touch

analog front-end IC for CISPR 25 class 5 compliance with low-frequency driving under 150kHz. The 2.21mm² silicon area chip achieves an SNR of 49.6dB, frame

rate of 200Hz, and a power consumption of 10.5mW for the 8.0-inch 28×18 touch screen panel.

Session 6 Highlights: Imagers and Displays

[6.1] A 3-Stacked Hybrid-Shutter CMOS Image Sensor with Switchable 1.2µm-Pitch

50Mpixel Rolling Shutter and 2.4µm-Pitch 12.5Mpixel Global Shutter Modes for Mobile

Applications

[6.5] A 25.2Mpixel 120frames/s Full-Frame Global-Shutter CMOS Image Sensor with

Pixel-Parallel ADC

[6.8] A Real-Time Pixel-Compensated Source-Driver IC with Dual-Slope Error Detection

and Multi-Channel Time-Multiplexing Compensator for Compact OLED Displays

Paper 6.1 Authors: Heesung Shim, Seung-Sik Kim, Min-Woong Seo, Sangsu Park, Hyukbin Kwon, Yongjun Kim, Sanggwon

Lee, Sungbong Park, Daehee Bae, Si Gyoung Koo, Masamichi Ito, Jae-hoon Jeon, Sol Yoon, Sung-Jae Byun, Sangyoon Kim,

KwanSik Kim, Gihwan Cho, Joonho Lee, Tekyou Kim, Sungjae Jun, Jae-kyu Lee, Chang-Rok Moon, Jaihyuk Song

Paper 6.1 Affiliation: Samsung Electronics, Hwaseong, Korea

Paper 6.5 Authors: Toshiki Kainuma, Ryo Wakamatsu, Kimitaka Wada, Tohru Takeda, Shota Ueyama, Hiroki Suto, Tsukasa

Miura, Koushi Uemura, Masao Kimura, Masaki Sakakibara, Yusuke Oike

Paper 6.5 Affiliation: Sony Semiconductor Solutions, Atsugi, Japan

Paper 6.8 Authors: Jaewoong Ahn, Seung Hun Choi, Jun Yeol An, Hyung-Min Lee

Paper 6.8 Affiliation: Korea University, Seoul, Korea

Subcommittee Chair: Rikky Muller, University of California, Berkeley, CA

CONTEXT AND STATE OF THE ART

● 3-layer stacked image sensor technology with high-capacity DRAM can switch from global to rolling shutter modes and

combines the strengths of both architectures to achieve a large pixel array with a high dynamic range.

● Pixel-parallel ADC enables high-speed imaging at high resolution in full-frame global shutter image sensors for full-fledged

consumer cameras.

● Compact and low-power source drivers with high tolerance to process variations of TFT in each pixel are required in many

emerging applications such as high-resolution smartphone displays.

TECHNICAL HIGHLIGHTS

● Samsung Electronics presents a 3-layer stacked hybrid shutter CMOS image sensor capable of switching between

global shutter and rolling shutter modes. The design combines the advantages of both architectures, offering a 50MP

rolling shutter with 1.2μm pixel pitch and a 12.5MP global shutter with 2.4μm pixel pitch.

o Featuring a high-capacity DRAM technology, this hybrid approach ensures a high dynamic range, excellent full-well

capacity of 52ke-, and low random noise 2.4e-, in global shutter mode.

● Sony Semiconductor Solutions introduces a 90nm/40nm 25.2Mpixel global shutter image sensor for full-fledged

consumer cameras capable of acquiring 14b 25.2Mpixel images at 120fps with 75.5dB dynamic range.

o A pixel-parallel ADC architecture converts pixel values with 14b accuracy and a random

noise of 2.66 e-rms at 0dB gain at high speed. This results in a record FoM of 0.083 e-

rms∙nJ/step.

● Korea University discloses a source-driver IC with real-time pixel compensation for compact OLED displays in 250nm

CMOS suppressing the effect of pixel variations.

o A compensator with noise-reduction current conveyor and dual-slope error detector reduces the pixel variation effect

while achieving the smallest silicon area of 4442.8μm² and the lowest power consumption of 13.3μW per channel.

APPLICATIONS AND ECONOMIC IMPACT

● Leveraging on 3D integrated technology, CMOS Image Sensors can acquire high-resolution, large-bit-depth images at high

frame rates and low power without compromises on image quality, opening new opportunities in consumer applications,

including mobile, AR/VR, and full-fledged cameras.

● High-quality displays with compact, low-power, pixel-compensated source drivers are suitable for mobile devices.

Session 7 Overview: Ultra-High-Speed Wireline

Wireline Subcommittee

Session Chair: Ben (Hyo Gyuem) Rhew, Samsung Electronics, Hwaseong, Korea

Session Co-Chair: Jay Im, AMD, San Jose, USA

Subcommittee Chair: Thomas Toifl, Cisco Systems, Thalwil, Switzerland

The advent of AI (artificial intelligence) and HPC (high performance computing) stretches the data communication bandwidth limit between chips,

modules, and data centers, leveraging ultra-high-speed wireline transceivers. The papers in this session describe transmitter and receiver architectures

that enable ultra-high-speed operation while consuming low power. The first two papers (7.1 and 7.2) present 212.5Gb/s DSP-based PAM-4 transceivers

compensating beyond long-reach channels (> 40dB channel loss). The third paper proposes an extra-short-reach 212.5Gb/s receiver employing a slice-

based CTLE and phase-interpolator-based clock generator achieving an energy efficiency of 1.11pJ/b. The fourth paper describes a 112 Gb/s DSP-

based PAM-4 receiver with an LC resonator-based CTLE compensating up to 52dB channel loss. The fifth paper demonstrates a 112Gb/s discrete

multitone receiver with a time-based ADC consuming 353mW. The sixth paper proposes a 106.25Gb/s PAM-4 receiver achieving an energy efficiency of

2.06pJ/b assisted by a 3-tap FFE and a 1-tap speculative DFE. Paper 7.7 and 7.9 present advancement in 50 and 60Gb/s NRZ burst-mode CDR solutions,

respectively, for passive optical network (PON) applications. Paper 7.8 describes a reference-less CDR exploiting a SAR-based frequency acquisition

technique to achieve 63.64Gb/s/μs acquisition speed. Paper 7.10, the last paper of this session demonstrates an 8-phase clock generator, achieving a

wide range of clock speeds from 8 to 28GHz with the help of a dual-feedback ring oscillator.

• In Paper 7.1, MediaTek presents a 4nm 212.5Gb/s DSP-based PAM-4 transceiver, in which each lane incorporates TX/RX PLLs. Designed in

4nm FinFET technology, the transceiver achieves an MLSD BER of 2.5e-6 and 1e-8 over a channel with 50.5dB and 46dB loss, respectively. The

RLM and SNDR of the transmitter are measured at 98.5 and 35.5dB, respectively. The receiver achieves an SNDR of 35dB at a 1GHz sinusoidal

input frequency. The transceiver consumes 5.3pJ/b per lane at a 212.5Gb/s data rate with 1.8pJ/b from DSP.

• In Paper 7.2, Marvell presents a 4-lane 212.5Gb/s PAM-4 transceiver, implemented in 5nm FinFET technology. The transceiver achieves a BER

of 6e-9 on a 46dB channel and an analog energy efficiency of 2.2pJ/b. The current mode transmitter shows an SNDR of 36.1dB and an RMS

jitter of 73fs at a 212.5Gb/s data rate.

• In Paper 7.3, Peking University presents a 224Gb/s XSR receiver in 12nm CMOS technology. The receiver features a slice-based CTLE with

optimized group delay, achieving a -3dB bandwidth of 59GHz and a maximum step size of 1.9dB. The PI-based clock generator with pre-distortion

enables an 8-way interleaved architecture. The receiver demonstrates a BER of 8.1e-7 at 224Gb/s over a 13.6dB-loss channel with only a 3-tap

TX FFE and an efficiency of 1.11pJ/b.

• In Paper 7.4, MediaTek describes a 112Gb/s DSP-based PAM-4 receiver implemented in 4nm FinFET technology. An LC-resonator-based CTLE

is proposed to compensate for > 52dB channel loss along with an analog data path latency reduction technique. A single lane receiver with CDR

consumes 210mW, and the analog transceiver occupies 0.367mm2.

• In Paper 7.5, Daegu Gyeongbuk Institute of Science and Technology demonstrates a 5nm ADC-based discrete multitone (DMT) wireline receiver

featuring a time-interleaved time-based ADC and a frequency domain DSP that equalizes DMT symbols. The receiver achieves a BER of 1e-4 at

112Gb/s over a 12dB-loss channel, consuming 353mW.

• In Paper 7.6, National Tsing Hua University presents a 106.25Gb/s PAM-4 receiver with 3-tap FFE and 1-tap speculative DFE in 28nm technology.

The adaptive FFE generates 1+0.5D and the speculative PAM-4 DFE resolves 1+0.5D response with a time-interleaved 3b+1b ADC. The receiver

can compensate for 21.2dB loss at a power efficiency of 2.06pJ/b.

• In Paper 7.7, Peking University describes an analog 50Gb/s burst-mode NRZ receiver in 28nm CMOS technology for symmetric 50G-PON. 5-tap

FFE and 7-tap DFE including 4 floating taps are implemented to achieve a sensitivity of -24dBm while consuming smaller power than the DSP-

based works. Edge equalization in this work realizes a max lock time of 15ns.

• In Paper 7.8, Xidian University proposes a reference-less CDR using a SAR-based frequency acquisition (FA) technique. The CDR achieves a

constant band switching time of 55ns utilizing the frequency error polarity information. With the help of the proposed charge pump, the total FA

time of <150ns is realized.

• In Paper 7.9, Xi’an JiaoTong University presents a 60Gb/s NRZ burst-mode CDR in 28nm technology. A cross-injection locking technique

suppresses the data-induced side-band noise of the recovered clock, with measured RJ of 140fs and TJ of 3.2ps at 30GHz. A flash phase detector

is introduced to extract the phase information within 2UI, achieving a total reconfiguration time of 0.13ns.

• In Paper 7.10, Fudan University demonstrates an 8-phase 8-to-28GHz clock generator, implemented in 28nm technology. The dual-feedback ring

oscillator proposed in this work decouples the frequency setting loop and the phase locking loop, thus expanding the maximum operation

frequency. The measured output jitter and the maximum phase error are below 40fs and 3° across 8-28GHz, respectively.

Session 7 Highlights: Ultra-High-Speed Wireline

[7.1] A 212.5Gb/s DSP-Based PAM-4 Transceiver with 50dB Loss Compensation for

Large AI Systems Interconnects in 4nm FinFET

Paper Authors: Henry Park*1, E-Hung Chen*2, Mohammed Abdullatif*1, Miguel Gandara1, Ahmed ElShater1, Amr Khashaba1, Shih-

Hao Huang2, Tsz-Bin Liu2, Atharav Atharav1, Joonyeong Lee1, Qaiser Nehal1, Mohamed Megahed1, Yusang Chun1, Cheng-En Shieh2,

Vidhan Jolly1, SoonWon Kwon1, Hsin-Ta Chien2, Ke-Chung Wu2, Cheng-En Liu2, Peng Yan1, Po-Jui Li2, Chun-Han Chen2, Tzu-Shun

Lin2, Pei-Chieh Liu2, Tamer Ali1

Paper Affiliation: 1MediaTek, Irvine, CA, 2MediaTek, Hsinchu, Taiwan

[7.2] A 2.2pJ/b 212.5Gb/s PAM-4 Transceiver with > 46dB Reach in 5nm FinFET

Paper Authors: A. Mostafa1, A. Hassan1, A. Hsu2, A.K. Singh3, C.-H. Wu4, C.-R. Yang1, D. Prabakaran3, D. Storaska5, D. Zhou1, D.

Visani1, E. Hsiao1, F. Chu1, F. Khan1, F. Lu1, G. Cui1, G. Wang1, H. Wang1, J. Natonio5, J. Deng1, J. Ding1, J. Guo1, J. Gu1, J. Zang1, L.

Jiang1, K. Chang1, K.-M. Lu4, M.Hasan1, M. Kelly6, M. H. Kashani2, M. Gambhir1, M. R. Patoju3, M. Singh1, M. Shannon5, M. Yang1, P.

Liu1, P. Ramakrishna3, R. Chen4, R. Ho7, S. N. Shahi8, S. Sivakumar1, S. Xu7, X. Yang1, X. Han1, Y.-P. Su4, Z. Guo1, Z. Li1, Z. Yu1, Z.

Yan1

Paper Affiliation: 1Marvell, Santa Clara, CA, 2Marvell, Toronto, Canada, 3Marvell, Bangalore, India, 4Marvell, Zhubei, Taiwan, 5Marvell,

Fishkill, NY, 6Marvell, Massachusetts, MA, 7Marvell, Burlington, VT, 8Marvell, Kanata, Canada

Subcommittee Chair: Thomas Toifl, Cisco Systems, Thalwil, Switzerland

CONTEXT AND STATE OF THE ART

• The advent of AI and high-performance computing (HPC) is pushing the bandwidth limits of data communication between

chips, modules, and data centers, relying on ultra-high-speed wireline transceivers.

• Innovative designs are crucial for achieving area and power efficiency in a 200Gb/s link while maintaining a low BER over

channels with losses exceeding 40dB.

TECHNICAL HIGHLIGHTS

• Mediatek presents a complete DSP-based PAM-4 transceiver at 212.5Gb/s in 4nm FinFET technology to compensate

channel loss over 46dB.

o The 4nm DSP-based PAM-4 transceiver incorporates TX/RX and PLLs in each lane.

o The RLM and SNDR of the transmitter and receiver are measured at 98.5% and 35.5dB respectively.

o The transceiver achieves an MLSD BER of 2.5e-6 and 1e-8 over a channel with 50.5dB and 46dB loss, respectively.

o The transceiver consumes 1127mW (5.3pJ/b) per lane at 212.5Gb/s data rate.

• Marvell demonstrates a 4-lane 212.5Gb/s PAM-4 transceiver in 5nm FinFET technology to compensate channel loss of

46dB.

o The Serdes transceiver is configured as a 4-lane module.

o The current mode transmitter shows a SNDR of 36.1dB and an RMS jitter of 73fs at a 212.5Gb/s data rate.

o The RX equalized eye after compensating for a loss of 46dB and achieving a PRE-FEC BER of 6e-9.

o The transceiver achieves an analog energy efficiency of 2.2pJ/b at a 212Gb/s data rate.

APPLICATIONS AND ECONOMIC IMPACT

• Long-reach low-power 224Gb/s PAM-4 SerDes transceivers are key components to enable 1.6Tb/s Ethernet for emergent AI

applications.

• Burst-mode NRZ receivers incorporating an equalizer mitigate the bandwidth limitations of optical devices in the next

generation passive optical access networks.

• The presented circuits demonstrate application potentials in standards such as PCIe 7.0, 50G-PON, IEEE802.3dj and OIF

CEI-224G.

Session 8 Overview: Digital Techniques for System

Adaptation, Power Management and Clocking

Digital Circuits Subcommittee

Session Chair: Heein Yoon, UNIST, Ulsan, South Korea

Session Co-Chair: Benton Calhoun, University of Virginia, Charlottesville, VA

Digital architectures continue to improve energy efficiency through closed-loop hardware-software integration, on-die sensors, and real-

time adaptation. The first four papers in this session exemplify power management techniques that improve computing and energy

efficiency and enable efficient Dynamic Voltage and Frequency Scaling (DVFS) in computing systems such as CPU and SoCs. In the

second half of this session, papers explore advanced clocking and power management circuits, featuring a command-aware hybrid LDO

for HBM, a low-spur low-jitter all-digital output frequency divider, two sensors for monitoring thermal profiles or supply and temperature

simultaneously, and an on-cell battery monitoring system.

• In Paper 8.1, IBM presents dynamic guard-band features for the 5nm 5.5GHz zNext computing system. A combination of on-die sensors, a run-

time control loop, and reliability, serviceability and availability features demonstrate savings of 18% total chip power corresponding to 10% system

power savings enabling higher frequency, higher performance, multiple system configurations and new AI features.

• In Paper 8.2 MediaTek researchers present a run-time power management system featuring an on-die power sensor in a 3nm CPU, achieving

94.83% CPU power measurement accuracy with calibration by a silicon-based ML model. On-die current readouts at 10ms sampling rate limits

over-current excursion for power stability in smartphone applications.

• In Paper 8.3, Georgia Institute of Technology presents a dynamically reconfigurable voltage regulator fabric for energy-efficient DVFS in SoCs in

a 65nm CMOS process. The test chip demonstrates a peak power reduction compared to conventional static VR architectures by up to 45% over

single-buck regulation and 25% for two-buck regulation.

• In Paper 8.4, Samsung Electronics presents a voltage droop detector with a 4GHz frequency and sub-2-cycle detection latency in 2nm GAAFET

process. Experimental results demonstrate 0.69% accuracy and effective droop detection with 250ps response time, which is a 8× improvement

for over previous droop detectors, while reducing area overhead by 15-43%.

• In Paper 8.5, Seoul National University presents a command-aware hybrid LDO in a 40nm CMOS process to reduce voltage droops caused by

quarter-rate data strobe buffers in HBM by proactively leveraging the known current profile of memory accesses. The measured voltage droop is

less than 10mV, while consuming only 150mA of quiescent current with 20pF output capacitor.

• In Paper 8.6, the National University of Defense Technology presents an all-digital fractional output frequency divider in a 28nm CMOS process

integrating background calibration for both gain and INL using a split-DTC technique. The proposed design achieves spurious tones levels below

-77dBc and jitter smaller than 257fs with a chip area of 0.024mm².

• In Paper 8.7, Georgia Institute of Technology and Intel present a coupled voltage-temperature sensor employing sensor fusion in 65nm CMOS.

The proposed architecture employs pairs of low-cost sub-sensors of only 67mm² instantiated into voltage-scaled digital domains and relies on

computational sensor-fusion to achieve 2.4°C and 9mV ±3s inaccuracy.

• In Paper 8.8, Intel presents a 16nm digital thermal sensor based on a current-starved ring oscillator, aiming for fine-grain thermal profiling with

high resiliency in noisy environments. A chiplet with a buck IVR integrates an array of 204 sensors, demonstrating high accuracy of ±0.7°C

dissipating 18mW power and having a 400mm² area footprint per sensor.

• In Paper 8.9, Dukosi presents a single-chip solution in 55nm to monitor individual cells in electric vehicle battery packs using near-field contactless

monitoring to minimize the wiring overhead of up to 216 battery cells to a single antenna. The chip consumes an average of 550mW per cell at

3.7V during networked monitoring for 0.1Hz measurements and communication.

Session 8 Highlights: Digital Techniques for System

Adaptation, Power Management and Clocking

[8.1] Dynamic Guard-Band Features of the IBM zNext System

[8.2] Run-Time Power Management System by On-Die Power Sensor with Silicon AI

Calibration in a 3nm Octa-Core CPU

Paper 8.1 Authors: Tobias Webel1, Phillip J. Restle2, Ramon Bertran2, Alper Buyuktosunoglu2, Sean M. Carey3, Alejandro Cook1, Karl

Anderson3, Michael Romain3, Thomas Strach1, Pradeep Bhadravati Parashurama4, Aishwarya Tadkase4, Rahman Abber Tahir1, Luke

Jenkins3, Kevin Low3, Eberhard Engler1,

Paper 8.1 Affiliation: 1IBM Systems, Boeblingen, Germany, 2IBM Research, Yorktown Heights, NY, 3IBM Systems, Poughkeepsie,

NY, 4IBM Systems, Bangalore, India

Paper 8.2 Authors: Chien-Yu Lu1, Bo-Jr Huang1, Min-Chieh Chen1, Alfred Tsai1, Eric Jia-Wei Fang1, Yuju Cho1, Rex Liu1, Ericbill

Wang1, Hugh Mair2, Shih-An Hwang1

Paper 8.2 Affiliation: 1MediaTek, Hsinchu, Taiwan, 2MediaTek, Austin, TX

Subcommittee Chair: Huichu Liu, Meta, Sunnyvale, CA, Digital Circuits

CONTEXT AND STATE OF THE ART

● Dynamic adaptation and run-time power management employing on-die sensors, firmware and workload management

continues to be critical for realizing robust and energy-efficient advanced-node processors from datacenter to mobile

applications.

● Advances in on-die circuit sensing, including ML-based calibration, enable robust and efficient run-time management of digital

systems.

TECHNICAL HIGHLIGHTS

● IBM introduces dynamic guard-band features for the 5nm 5.5GHz zNext computing system.

o On-die digital droop sensors, multiple control loops, and reliability, availability and serviceability features are

effectively combined to achieve 18% total chip power reduction, resulting in 10% system power savings, enabling

higher performance system configurations and new AI features.

● MediaTek presents a run-time power management system in a 3nm octa-core mobile processor SoC, featuring an on-

die power sensor with calibration by a silicon-based ML model.

o An on-die power sensor implemented in a 3nm CPU core complex measures power draw with 94.8% accuracy. The

sensor, calibrated with a silicon-based ML model, is used to achieve per-core power budgeting and to control over-

current excursion for power stability in smartphone applications.

APPLICATIONS AND ECONOMIC IMPACT

● Dynamic guard banding and advanced run-time power management are critical to enable high performance with high

frequency and reduce total power to enable new features in systems, like AI acceleration. High accuracy on-die sensors

coupled with firmware and workload balancing provide a framework for high availability and reliable systems that reduce

overall life-time operating cost.

Session 9 Overview: Ubiquitous Power Delivery

Power Management Subcommittee

Session Chair: Dongsu Kim, Samsung Electronics, Hwaseong, Korea

Session Co-Chairs: Chen-Yen Ho, MediaTek, Hsinchu, Taiwan

Power delivery circuits are employed in all types of electronic applications, and their technological innovations continue to advance. In

this session, the latest advanced technologies for AC-DC converters, supply modulators for RF power amplifiers, low dropout regulators,

wireless power transfer, and hybrid converters are presented to demonstrate their novel topologies achieving higher efficiency, higher

output power, faster transient response and smaller form factor.

• In Paper 9.1, University of Macau presents a capacitor-drop sigma-floating-SC AD-DC converter for IoT and smart home devices. With a sigma-

SC rectifier and a floating-SC converter, it achieves 81.3% peak efficiency and maintain high efficiency for wide load current range.

• In Paper 9.2, University of California, Berkeley presents a fully integrated symbol-power-tracking (SPT) CMOS RF power amplifier with a non-

uniform multi-level supply modulator and a SPT-adaptive-biasing network. This work supports 5G FR2 standard with 400MHz channel bandwidth

and has a 30% improved efficiency compared to the average power tracking operation.

• In Paper 9.3, University of California, San Diego presents a GaN monolithic integrated adjustable supply modulator for digital envelope tracking

transmitters in 5G base stations. The implemented circuit supports 4 tunable voltage levels and improves the power-added efficiency of a

transmitter with a rated power of 74W/48V by 6.9%.

• In Paper 9.4, University of Science and Technology of China presents a symbol-power-tracing (SPT) supply modulator for 5G NR power amplifiers.

With a ripple-cancellation technique and a small load capacitance, it achieves fast up- and down-tracking of 510ns/5V and 900ns/5V, and <12mV

output voltage ripple.

• In Paper 9.5, Sogang University presents a sub-1V PMOS analog low dropout regulator (LDO). The LDO with a rail-to-rail pseudo impedance

buffer (RRPB) and a load-independent Gm-boosting cell (LIGC) supports very wide load current from 0 to 1.2A, input voltage from 0.7 to 1.4V and

dropout voltage from 0.05 to 0.2V while maintaining the phase margin.

• In Paper 9.6, Nanjing University presents a 6.78MHz single-stage dual-output regulating rectifier for a biomedical wireless powering. By supporting

a simultaneous charging of dual outputs in a half cycle and a charge distribution operation, it achieves 92.2% peak efficiency with a negligible

load transient response and an unnoticeable cross-regulation.

• In Paper 9.7, Xi’an JiaoTong University presents a Class-E transmitter for a wireless power transfer system. With an adaptive real-part impedance-

matching technique and an imaginary-part phase compensation controller, the transmitter achieves high transmitter efficiencies of 88.0 to 94.2%

for very wide load range from 3 to 29W, and a maximum E2E output power of 33W.

• In Paper 9.8, Samsung Electronics presents a 50W high-power wireless charging system with an acoustic noise-reduced ASK modulation

technique and an internal hybrid voltage-/current-mode ASK demodulator. This work significantly reduces an acoustic noise of MLCC capacitors

up to 20dB and eliminates all related external components of the demodulator.

• In Paper 9.9, University of Macau presents a bi-directional dual-path boost-48V-buck hybrid converter for a high-voltage power transmission cable

in light-weight humanoid robots. By reusing a parasitic cable inductor and employing a phase alignment scheme, the converter achieves a peak

efficiency of 96.7% at 16.5W and a maximum transmission power of 45W without requiring an additional bulky power inductor.

• In Paper 9.10, Southern University of Science and Technology presents a hybrid boost converter with continuous input and output currents. By

placing an inductor between two switched-capacitor step-up stages, the converter has a significantly reduced inductor current at a high voltage

conversion ratio and a fast transient response due to the elimination of a right-half-plane zero. It delivers a maximum output power of 10.8W with

a peak efficiency of 93% and a high power density of 49.5mW/mm3.

• In Paper 9.11, University of Science and Technology of China presents a single-mode hybrid buck-boost converter for Li-ion battery management.

The converter has a half-Vin voltage stress and a continuous output current delivery using 7 switches and 2 flying capacitors, and it achieves

98.3% peak efficiency and 7mV output voltage ripple without mode transition issues.

Session 9 Highlights: Ubiquitous Power Delivery

[9.1] An 85-230VAC to 3.3-4.6VDC 1.52W Capacitor-Drop Sigma-Floating-SC AC-DC

Converter with 81.3% Peak Efficiency

[9.2] A 400MHz Symbol-Power-Tracking (SPT) Supply Modulator with SPT-Adaptive-

Biasing Network Supporting 5G FR2 CMOS PA

Paper 9.1 Authors: Fei Song1, Shousheng Han1,2, Rui P. Martins1, Yan Lu1,2

Paper 9.1 Affiliation: 1University of Macau, Macao, China, 2Tsinghua University, Beijing, China

Paper 9.2 Authors: Jongbeom Baek, Ali Niknejad

Paper 9.2 Affiliation: University of California, Berkeley, CA

Subcommittee Chair: Bernhard Wicht, University of Hannover, Germany, Power Management

CONTEXT AND STATE OF THE ART

● A merged topology of a sigma-SC rectifier and a floating-SC converter improves output power and efficiency by the three

energy flow paths to the output capacitor.

● A non-uniform multi-level supply modulator and a SPT-adaptive-biasing network optimize the supply voltage and operating

bias point of the mmWave CMOS RF power amplifier and improve its efficiency and linearity performances.

TECHNICAL HIGHLIGHTS

● University of Macau presents a capacitor-drop sigma-floating-SC AD-DC converter with a sigma-SC rectifier and a

floating-SC converter for IoT and smart home devices.

o The AD-DC converter generates a 3.3-4.6VDC output from an 85-230VAC input with a maximum output power of

1.52W and a peak efficiency of 81.3%.

● University of California, Berkeley presents a fully integrated symbol-power-tracking (SPT) CMOS RF power amplifier

with a non-uniform multi-level supply modulator and a SPT-adaptive-biasing network.

o The SPT CMOS RF power amplifier supports 5G FR2 standard with 400MHz channel bandwidth and has a 30%

improved efficiency compared to the average power tracking operation.

APPLICATIONS AND ECONOMIC IMPACT

● The AC-DC converter with high efficiency and small form factor enhances the cost and power competitiveness of IoT and

smart home devices.

● The SPT CMOS RF power amplifier improves the performance of 5G FR2 communication in terms of data rate and power

consumption.

Session 10 Overview: Transceiver Chipsets for

Communication and Radar

Wireless Subcommittee

Session Chair: Matteo Bassi, Infineon Technologies, Villach, Austria

Session Co-Chair: Shahriar Shahramian, Nokia – Bell Labs, New Providence, NJ

Integrated circuits continue to advance the performance of wireless applications from a few megahertz to hundreds of gigahertz. Wireless

communication, radar chipsets & ultra-low-power transceivers are ubiquitous in our industry. This session presents several key

innovations in such domains.

• In Paper 10.1, East China Normal University demonstrates a 77GHz TDMA-MIMO Phased-Array radar implemented in a 55nm CMOS process.

It includes 8TX and 2× 4RX elements with 32dBm EIRP and 16.8dB Noise Figure while providing chirp bandwidth of 8GHz.

• In Paper 10.2, Tianjin University presents a D-Band 4TX-4RX FMCW transceiver array with Antenna-in-Package leveraging an LTCC substrate.

The module achieves a maximum of 28dBm EIRP, minimum Noise Figure of 7.8dB and up to 24GHz of chirp bandwidth.

• In Paper 10.3, ETH Zurich presents a reflect-array operating in D-Band leveraging orthogonally polarized on-chip antennas. The 4×4 array chipset

includes angle-of-arrival detection demonstrating a non-line-of-sight 6Gb/s wireless link at 1.3m.

• In Paper 10.4 Samsung Electronics presents a dual transceiver IR-UWB chipset with shared antennas that is compliant with IEEE 802.15.4/4.

The IC supports channels 5 to 12 with a peak power of 14dBm and a Noise Figure of 4.2dB with a transformer-based LC resonator at the TX/RX

shared ports.

• In Paper 10.5, XINYI Information Technology and XINYI Semi present a low-cost (QFN) 28nm RF transceiver supporting LTE Cat1bis for IoT

applications. The low-power IC consumes 40mW in receive mode and 113mW in TX mode covering 0.6 to 2.7GHz. The receiver achieves an

FDD/TDD sensitivity of better than -99dBm.

Session 10 Highlights: Transceiver Chipsets for

Communication and Radar

[10.2] A 132-to-148GHz CMOS 4TX-4RX FMCW Radar Transceiver Array with Cavity-

Backed Antenna-in-Package Achieving 28dBm EIRP

[10.3] A D-Band 2-D Scalable 4×4 Active Reflective Relay with Orthogonally-Polarized

On-Chip TX/RX Antennas and In-Front-End Common-Centroid Fast Azimuth/Elevation

Angle-of-Arrival Detection

Paper 10.2 Authors: Bing Liu, Jiancheng huang, Zheng Yang, Xuguang Li, Jing Zhang, Xu Wang, Hao Shi, Zhenghua Xu, Fei Li,

Ruipeng Liu, Shuangxu Li, Yongqiang Wang, Keping Wang, Haipeng Fu, Fanyi Meng, Kaixue Ma

Paper 10.2 Affiliation: Tianjin University, Tianjin, China

Paper 10.3 Authors: Basem Abdelaziz Abdelmagid, Boce Lin, Hua Wang

Paper 10.3 Affiliation: ETH Zurich, Zurich, Switzerland

Subcommittee Chair: Chih-Ming Hung, Taiwan, Taipei, Wireless

CONTEXT AND STATE OF THE ART

● With the increasing demand for data rates, the mm-wave and sub-THz spectrums have been actively explored for future high-

resolution radars and high-data-rate wireless communications.

● The high free-space path loss (FSPL) and the challenges related to establishing non-line-of-sight (NLOS) communication links

limit the effectiveness of communication and radar systems at those frequencies.

● Innovations in architectures and circuit techniques for transceivers, including packaging, help advance the performance barrier

with robustness.

TECHNICAL HIGHLIGHTS

● Tianjin University presents a 132-to-148GHz 4TX-4RX FMCW radar transceiver array that achieves state-of-the-art EIRP.

o A cavity-backed antenna-in-package (AiP) 4-transmitter and 4-receiver FMCW array achieves a remarkable 28dBm

equivalent isotropic radiated power (EIRP) without additional lenses in CMOS technology. The design incorporates a

high-power amplitude-phase harmonic modulated quadrupler, a low-loss power-combiner-based high-power power

amplifier (PA), and a cavity-backed AiP, enabling high-resolution radar and 3D imaging capabilities.

● ETH Zurich presents a D-Band active reflective relay with on-chip TX/RX orthogonally-polarized antennas.

o Non-line-of-sight (NLOS) communication is demonstrated with successful data transmission rates of 2.4 to 4.8Gb/s

over a 1.3-meter round-trip distance without prior knowledge of the incident beam's AoA. This is achieved with novel

design of a 120GHz 2-D scalable active reconfigurable reflective relay, which incorporates orthogonally polarized on-

chip 4×4 TXRX antennas and in-front-end distributed common-centroid mixed-signal fast angle-of-arrival (AoA)

detection.

APPLICATIONS AND ECONOMIC IMPACT

● Advancements in architectures, circuits and packaging techniques for mm-wave enable the development of new products and

services, such as high-capacity wireless networks, advanced radar systems, and medical imaging devices.

● Millimeter-wave technologies can improve the efficiency of existing industries by providing faster data transfer rates, higher

resolution imaging, and more accurate sensing capabilities.

Session 11 Overview:

RF and mm-Wave Wireless Receivers

Wireless Subcommittee

Session Chair: Negar Reiskarimian, Massachusetts Institute of Technology, Cambridge, MA

Session Co-Chair: Hao Gao, Eindhoven University of Technology, Netherlands

This session presents five papers on various techniques for improving RF and mm-wave wireless receiver performance. The first paper

demonstrates a phased-array RX capable of receiving concurrent right- and left-handed circular polarizations. Papers 2 and 5 focus on

techniques to combat interference in RF and mm-wave receivers, one using non-uniform multi-level time-approximation filtering and

another using a higher-order TIA. Paper 3 demonstrates a wideband full-duplex receiver, and Paper 4 focuses on a low-power GNSS

receiver for positioning.

• In Paper 11.1, the Institute of Science Tokyo and Axelspace present a 256-element Ka-band phased-array receiver using switch-type quadrature-

hybrid-first architecture supporting single/dual linear or circular polarizations. The receiver consumes 4.6/9.1W while receiving single/dual circular

polarizations with up to 8Gb/s data rate.

• In Paper 11.2, the University of Southern California and the University of Waterloo present a blocker-tolerant receiver featuring a non-uniform

VCO-based multi-level time-approximation filter. Operating at 32GHz, the 28nm CMOS prototype achieve maximum 65dB rejection supporting

150MHz 256-QAM modulation with EVM of -36/-31dB in the presence of a -20/+24dBc blocker.

• In Paper 11.3, Fudan University presents a 0.5-to-5GHz full-duplex RX with Hilbert transform cancellation in RF/BB based on multi-stage all pass

filters, achieving >37dB on-chip cancellation over 80-to-120MHz BW.

• In Paper 11.4, Southeast University and the Nanjing Low Power IC Technology Institute present a 28nm 0.47mm2 GNSS receiver. A 0.65V Gm-

C signal-quadrature RF front-end is embedded within a passive mixer, allowing the receiver to achieve a 3.2dB NF, -164dBm tracking sensitivity

and -148dBm acquisition sensitivity at 2mW power consumption.

• In Paper 11.5, Fudan University presents a 0.5-to-3GHz blocker-tolerant RX front-end with high linearity (OOB IIP3 of 14.4dBm and 19dBm for

offset/BW = 1 and 2, respectively) and selectivity (-100dB/decade far-OOB roll-off), for wideband applications (0.5 to 3GHz) at

71.5mW+11.7mW/GHz.

Session 11 Highlights:

RF and mm-Wave Wireless Receivers

[11.5] A 200MHz-BW Blocker-Tolerant Receiver with Fifth-Order Filtering Achieving

19dBm Adjacent-Channel IIP3

Paper 11.5 Authors: Liangbo Lei, Yanxiang Chen, Yijie Li, Zhiliang Hong, Yumei Huang

Paper 11.5 Affiliation: Fudan University, Shanghai, China

Subcommittee Chair: Chih-Ming Hung, MediaTek, Taiwan

CONTEXT AND STATE OF THE ART

• 5G radios need to operate over a multitude of closely spaced bands, particularly at sub-7GHz frequencies, enabling global

spectrum compatibility.

• These radios should operate over a wide frequency range with increased bandwidth while rejecting strong blockers and

interferers both within (as close as adjacent channel) and outside the band.

TECHNICAL HIGHLIGHTS

• Fudan University introduces a 0.5-to-3GHz interference-tolerant radio front-end using a first-order N-path notch filter

in parallel with the LNTA and a fourth-order TIA to achieve fifth-order filtering.

o The 40nm IC adopts a multi-gate average biasing technique within the LNTA to enhance IIP3. The proposed TIA

architecture enables a steep close-in selectivity. The RX achieves OOB IIP3 of 19dBm for offset/BW = 2 and -100

dB/decade far-OOB roll-off.

APPLICATIONS AND ECONOMIC IMPACT

• With the expansion of 5G wireless networks the need for interference-resilient radios both in-band and out-of-band becomes

more and more significant.

• Demands for higher data rates impose stringent design requirements on the RF receivers in terms of BW, power consumption

and linearity.

Session 12 Overview:

Innovations from Outside the (ISSCC) Box

Technology Directions Subcommittee

Session Chair: Kaushik Sengupta, Princeton University, Princeton, NJ

Session Co-Chair: Firooz Aflatouni, University of Pennsylvania, Philadelphia, PA

There have been numerous new and important developments outside the circuit community that can have profound impact on the solid-

state circuits society, either through emergence of new applications, or, via offering of new platforms for processing, communications,

and sensing. This invited session seeks to provide exposure to a few of such developments for the ISSCC audience who may not have

day-to-day interactions with such subjects in the hope of inspiring new ways of thinking in circuit design and collaboration opportunities.

• In Paper 12.1, Berkeley presents an overview of principle of computation for artificial Intelligence and optimization based on Onsager’s Principle

of minimum entropy generation, referred to as Onsager Computing as opposed to conventional Von Neumann Computing. Electrical Onsager

computers is an emerging computing paradigm with potential for drastic reduction of time and energy to solution compared to conventional

machines. The paper presents several inequalities in physics as opportunities to be utilized for energy-efficient computing.

• In Paper 12.2, Purdue presents a new paradigm of computation, which is neither analog nor digital, titled the ‘p-circuit’. The p-circuit is a stochastic

computing element, and not a deterministic Boolean function of the inputs. The output of the device is a random binary variable whose probability

of being 1 is given by an analog function of the inputs. Utilizing such devices, new computing architectures can emerge that can enable orders of

magnitude reduction in energy for computing, optimization and learning problems.

• In paper 12.3, Caltech presents methodologies for optical imaging through deep tissues. The high optical turbidity of biological tissues prevents

scientists and clinicians from performing deeply penetrating high resolution optical imaging through humans and animal models alike. The paper

presents significant advances in wavefront shaping that have allowed time-reversing scattering and enabling focusing of light deep through

biological tissue for imaging and other applications. The paper also presents opportunities for integrated optoelectronic solutions for transformative

impact.

• In Paper 12.4, Stanford presents advances in e-skin technologies for developing electronic materials inspired by skin’s properties, such as

stretchability, self-healing ability and biodegradability. This class of new active materials can enable skin-like integrated circuits for neuromorphic

signal processing to generate spike-train signals. The paper presents broad opportunities for such skin-like sensors and integrated circuits for

applications in medical devices, robotics and wearable electronics.

Session 12 Highlights:

TBA

Session 13 Overview: Cool Computation Circuits

Technology Directions Subcommittee

Session Chair: Joseph Bardin, Google & UMass Amherst, USA

Session Co-Chair: Shawn S. H. Hsu, National Tsing Hua University, Hsinchu, Taiwan

This session focuses on cool design techniques for unconventional AI accelerators and cryogenic controllers for quantum computers.

The first two papers present high-accuracy high-dynamic-range voice activity detection and spoken language understanding SoCs

enabled by analog feature extractions. The next three papers describe cryo-CMOS ICs for ion-trap quantum processors and

semiconductor quantum devices. The final paper presents a low-cost energy-efficient DNN processor with via-programmable neuron

arrays.

• In Paper 13.1, Peking University, Advanced Institute of Information Technology of Peking University, and Nano Core Chip Electronic Technology

demonstrate a voice activity detection (VAD) system achieving 84%~98% accuracy across 0~15dB SNR at 161nW power. The system employs

an analog-domain information-aware data compressor that extracts extreme values of the raw signal to save the power of ADC and neuromorphic

temporal-spatial feature extractor.

• In Paper 13.2, University of Zurich/ETH Zurich and Aalto University present an SoC for end-to-end user intent understanding with continuous

speech input. The 65nm prototype exploits direct analog-to-feature conversion, delta-activation sparsity, and temporal pooling, to achieve >85%

32-class spoken language understanding (SLU) accuracy over 75dB input range at 8.62μW. Its analog feature extractor with global and channel-

level AGC boosts the dynamic range.

• In Paper 13.3, TU Braunschweig, Leibniz University, and Keio University present a cryo-BiCMOS 130nm IC for the control of single-qubit gates

on 9Be+ trapped ions, with the controller achieving 99.97% fidelity while dissipating 21.5mW per qubit.

• In Paper 13.4, University of Electronic Science and Technology of China presents a cryo-CMOS 65-nm controller adopting two 18-bit R-2R DACs

to accurately bias the terminals of a Single Electron Transistor (SET). Operating at 60mK with a total chip power dissipation of 60μW, each of the

DAC can ensure 0.8b/0.8b INL/DNL with a 4.5μV LSB.

• In Paper 13.5, CEA-Léti, CEA-List, CEA-Pheliqs, and Quobly present a cryo-CMOS readout IC for semiconductor spin qubits fabricated in a 22nm

FDSOI process, which exploits QAM modulation to multiplex several charge sensors for qubits sensing through the same front-end. Operated at

4.2K, the circuit dissipates 37.2μW to read out the conductance of 2 SETs with a BER=10-3 in a 5.5µs integration time.

• In Paper 13.6, University of Tokyo presents a via-programmable neuron array (VPNA) technology for low-cost, energy-efficient DNN processor

solutions for AI-IoT, where a prefabricated base chip can be configured for diverse DNN layers by adjusting the placement of vias. The proposed

bit- and neuron-serial circuit and sparse binary weighted DNN techniques reduce the chip area and the required number of tiles.

Session 13 Highlights: Cool Computation Circuits

[13.1] A 0.22mm2 161nW Noise-Robust Voice Activity Detection Using Information-

Aware Data Compression and Neuromorphic Spatial-Temporal Feature Extraction

[13.2] A Cryo-BiCMOS Controller for 9Be+ Trapped-Ion-Based Quantum Computers

Paper 13.1 Authors: Ying Liu*1, Jie Li*1, Qining Zhang*1, Tianhao Zhao2, Chenhao Shi1, Ninghui Shang1, Yufei Ma1, Zhixuan Wang3,

Ru Huang1, Linxiao Shen1, Le Ye1,2

Paper 13.1 Affiliation: 1Peking University, Beijing, China, 2Advanced Institute of Information Technology of Peking University,

Hangzhou, China, 3Nano Core Chip Electronic Technology, Hangzhou, China

Paper 13.2 Authors: Peter Toth1, Paul Eugine Shine1, Sebastian Halama2, Yerzhan Kudabay1, Kaoru Yamashita1,3, Hiroki Ishikuro3,

Christian Ospelkaus2, Vadim Issakov

Paper 13.2 Affiliation: Technische Universität Braunschweig, Braunschweig, Germany, 2Leibniz University Hannover, Hannover,

Germany, 3Keio University, Yokohama, Japan

Subcommittee Chair: Ali Hajimiri, Caltech, Pasadena, CA

CONTEXT AND STATE OF THE ART

● AI-enabled audio processing is widely demanded in the IoT space. Conventional digital processors are power-hungry for

such always-on applications, while analog-to-information front ends with algorithm-circuit co-design offer a promising

solution to circumvent the energy bottleneck while keeping sufficient accuracy.

● Cryogenic quantum processors are typically controlled by room-temperature electronics. However, when scaling up to the

large number of quantum bits (qubits) required for practical applications, wiring to such room-temperature controllers may

become the bottleneck.

TECHNICAL HIGHLIGHTS

● Peking University, Advanced Institute of Information Technology of Peking University, and Nano Core Chip

Electronic Technology extract extreme points from raw audio signals in the analog domain to achieve 2.6 data

reduction while keeping 97% information, which is followed by a neuromorphic temporal-spatial feature

extractor.

o The SoC achieves state-of-the-art 84%~98% voice activity detection (VAD) accuracy across 0~15dB input SNR, at

merely 161nW power consumption.

● TU Braunschweig, Leibniz University, and Keio University demonstrate a cryogenic controller for ion-trap-based

quantum computers.

o The BiCMOS 130nm cryogenic IC demonstrates a control fidelity of 99.97% when driving a single qubit gate on a

9Be+ trapped ion, which is on par with state-of-the-art room temperature controllers.

APPLICATIONS AND ECONOMIC IMPACT

● Always-on intelligent audio processing at an extremely low power budget facilitates interactions between humans and

electronic devices, advancing the frontier of AI IoT applications.

● Cryogenic controllers for quantum processors break the wiring bottleneck, hindering quantum-computer scaling, thus

enabling the control of the large-scale quantum processors needed for practical applications.

Session 14 Overview: Compute-In-Memory

Memory Subcommittee

Session Chair: Saekyu Lee, IBM Research, Yorktown Heights, USA

Session Co-Chair: Xueqing Li, Tsinghua University, Beijing, China

Memory access has been a major system-performance and energy-consumption bottleneck for traditional von Neumann system

architectures. Compute-in-memory (CIM) architectures eliminate this bottleneck by integrating compute operations into the memory array,

to reduce memory access latency and data movement overhead. Innovations in CIM design continue to improve energy and area

efficiencies while maintaining overall AI network accuracy. This session includes 7 papers showcasing the latest developments in gain-

cell, SRAM, and non-volatile CIM. Featured innovations include the 1st demonstration of a microscaling data format and STT-MRAM

based Bayesian neural network.

• In Paper 14.1, National Tsing Hua University and TSMC present an STT-MRAM CIM macro for noise-tolerant Bayesian neural networks with a

heterogeneous in- and near-memory MAC structure. The 22nm macro achieves 104.5TOPS/W with a 0.03% accuracy loss for CIFAR-100.

• In Paper 14.2, TSMC and National Tsing Hua University demonstrate the first CIM macro demonstrating the microscaling data format; achieving

133.5TFLPOS/W in a 16nm process.

• In Paper 14.3, Southeast University presents a 28nm floating-point macro with an adaptive-alignment scheme and non-2’s-complement MAC

achieving 62.84TFLOPS/W.

• In Paper 14.4, Tsinghua University presents a CIM macro targeting compound AI models with static/dynamic sparsity-aware acceleration and

post-CIM alignment to reduce error rates. The 28nm macro demonstrates 51.6TFLOPS/W for FP16 operations.

• In Paper 14.5, IME-CAS demonstrates a digital transpose CIM macro for floating-point training and inference; targeting edge-AI applications with

accurate/approximate dual-mode MAC operations. It achieves 48.08TFLOPS/W and 2.34TFLOPS/mm2 for BF16 operations in 28nm CMOS.

• In Paper 14.6, Southeast University presents a hybrid-CIM macro that implements a bit-rotated feature-in scheme for efficient compute allocation

between digital and analog domains. The macro achieves 67.8TOPS/W for INT8 operations in 28nm CMOS.

• In Paper 14.7, Southeast University presents a mimetic-path-searching CIM macro that demonstrates dual-direction searching to achieve a 3670M

nodes/s search rate and a 69.4fJ/node energy consumption.

Session 14 Highlights: Compute-in-Memory

[14.1] A 22nm 104.5TOPS/W μ-NMC-Δ-IMC Heterogeneous STT-MRAM CIM Macro for

Noise-Tolerant Bayesian Neural Networks

[14.2] A 16nm 216kb, 188.4TOPS/W and 133.5TFLOPS/W Microscaling Multi-Mode Gain-

Cell CIM Macro for Edge-AI Devices

Paper 14.1 Authors: De-Qi You1, Win-San Khwa2, Bo Zhang3, Fang-Yi Chen1, Andrew Lee1, Yu-Cheng Hung1, Yi-Ming Li1, Yu-Hui

Wang1, Chung-Chuan Lo1, Ren-Shuo Liu1, Kea-Tiong Tang1, Chih-Cheng Hsieh1, Yu-Der Chih4, Tsung-Yung Chang4, Meng-Fan

Chang1,2

Paper 14.1 Affiliation: 1National Tsing Hua University, Hsinchu, Taiwan, 2TSMC Corporate Research, Hsinchu, Taiwan, 3TSMC

Corporate Research, San Jose, CA, 4TSMC, Hsinchu, Taiwan

Paper 14.2 Authors: Win-San Khwa1, Ping-Chun Wu2, Jian-Wei Su2,3, Chiao-Yen Cheng2, Jun-Ming Hsu2, Yu-Chen Chen2, Le-Jung

Hsieh2, Jyun-Cheng Bai2, Yu-Sheng Kao2, Tsung-Han Lou2, Ashwin Sanjay Lele4, Jui-Jen Wu1, Jen-Chun Tien2, Chung-Chuan Lo2,

Ren-Shuo Liu2, Chih-Cheng Hsieh2, Kea-Tiong Tang2, Meng-Fan Chang1,2

Paper 14.2 Affiliation: 1TSMC Corporate Research, Hsinchu, Taiwan, 2National Tsing Hua University, Hsinchu, Taiwan, 3Industrial

Technology Research Institute, Hsinchu, Taiwan, 4TSMC Corporate Research, San Jose, CA

Subcommittee Chair: Meng-Fan Chang, National Tsing Hua University (NTHU), Taiwan

CONTEXT AND STATE OF THE ART

● Compute-in-memory has emerged as a power-efficient and area-efficient computing paradigm for neural-network acceleration

and other data-intensive applications for both cloud and edge scenarios.

● The AI-workload evolution requires higher accuracy and data format compatibility using lower area and power. Progress is

hindered by data-format conversion, adder-tree implementation, device and circuit non-idealities, and system-to-macro data

transfers.

● Innovations highlighted in this session enable reliable and high-efficiency computation in non-volatile STT-MRAM and volatile

gain-cell arrays with enhanced accuracy and data-format compatibility.

TECHNICAL HIGHLIGHTS

● NTHU and TSMC introduce an STT-MRAM-based CIM macro capable of performing noise-tolerant Bayesian neural-

network acceleration

o A μN-ΔI-CIM macro based on 22nm STT-MRAM achieves a 104.5TOPS/W power efficiency in BNN-INT8 mode with

a 0.03% accuracy loss for CIFAR-100 tasks.

● TSMC and NTHU present a microscaling integer/floating-point multi-mode gain-cell CIM macro capable of ultra-

power-efficient edge-AI acceleration.

o A 16nm 216kb multi-mode gain-cell CIM macro demonstrates a 133.5TFLOPS/W peak energy efficiency for MXINT8

inputs and MXINT8 weights, and a 91.9TFLOPS/W for BF16 inputs and BF16 weights with FP32 outputs.

APPLICATIONS AND ECONOMIC IMPACT

● The current commercial-AI hardware implementations are consuming significant amounts of power and silicon area; thus,

making AI-accelerator implementations more efficient is critical for expanding to new application scenarios.

● Compute-in-memory circuits and architectures enable low-power consumption and reduced manufacturing costs, while

preserving computing accuracy and reliability for emerging-AI workloads in various scenarios.

● AI compute-in-memory requirements will boost the development of fundamental volatile and non-volatile memory

technologies.

Session 15 Overview: Neural Interfaces and Edge

Intelligence for Medical Devices

Imagers, Medical and Displays Subcommittee

Session Chair: Azita Emami, California Institute of Technology, USA

Session Co-Chair: Taekwang Jang, ETH Zurich, Switzerland

Neural recording and decoding circuits continue to improve in terms of precision, robustness, and energy efficiency. The first three papers

present edge-computing SoCs for decoding the neural signals with high efficiency and accuracy. The next paper introduces an SoC with

advanced wireless communication and power telemetry capability. The final three papers describe reconfigurable, noise-efficient and

artifact tolerant neural interface circuits.

• In Paper 15.1, National Taiwan University presents a speech-decoding processor that supports a communication rate of up to 200words/min with

power consumption of 3.9mW.

• In Paper 15.2, Delft University of Technology introduces a spike-sorting chip that processes 1,024-channel neural signals with area and energy

efficiencies of 0.00029mm2/ch and 74nW/ch, respectively.

• In Paper 15.3, University of Notre Dame shows an uncertainty-quantifiable Bayesian convolutional neural network accelerator for ventricular

arrhythmia detection achieving 1.7µJ/inference.

• In Paper 15.4, University of Toronto presents a wireless neuroprosthetic SoC supporting simultaneous wireless power transfer and data link

(200Mb/s Tx, 60Mb/s Rx) with a resonant frequency splitting link. It also introduces an unsupervised adaptive clustering approach with 1.6µW/ch

multi-class spike sorting.

• In Paper 15.5, University of Toronto describes an event-based reconfigurable neural interface IC that offers three modes of 1b inverter-based

spike detector, zoomed active electrode, and high dynamic range interface, for high energy efficiency, high precision, and artifact tolerance,

respectively.

• In Paper 15.6, ETH Zurich introduces a direct-digitization neural front-end proposing delta-amplification, which improves both noise efficiency and

dynamic range, achieving an NEF of 3.47 and an FoMs of 175.2dB.

• In Paper 15.7, Ulsan National Institute of Science and Technology presents an AFE for two-electrode bio-potential recording amplifier with

common-mode interference tolerance of 133V using a common-mode interferer follower.

Session 15 Highlights: Precision neural and cardiac chips

[15.2] A 1024-Channel 0.00029mm2/ch 74nW/ch Online Spatial Spike-Sorting Chip with

Event-Driven Spike Detection and Self-Organizing Map Clustering

[15.3] A 65nm Uncertainty-Quantifiable Ventricular Arrhythmia Detection Engine with

1.75μJ per Inference

[15.7] A 4.6μW 2.6-NEF Biopotential Amplifier with 133VPP Common-Mode Interference

Tolerance and 102dB Total Common-Mode Rejection Ratio for Two-Electrode

Recording System

Paper 15.2 Authors: Arash Akhoundi1, Yawende Landbrug1, Pumiao Yan2, E. J. Chichilnisky2, Boris Murmann3, Dante Gabriel

Muratore1

Paper 15.2 Affiliation: 1Delft University of Technology, Delft, The Netherlands, 2Stanford University, Stanford, CA, 3University of

Hawaii, Honolulu, HI

Paper 15.3 Authors: Jianbo Liu, Zephan Enciso, Boyang Cheng, Likai Pei, Steven Davis, Yifan Qin, Zhenge Jia, Xiaobo Sharon Hu,

Yiyu Shi, Ningyuan Cao

Paper 15.3 Affiliation: University of Notre Dame, Notre Dame, IN

Paper 15.7 Authors: Yongjae Park1, Yeong-Jin Mo2, Jeong-Hoon Kim3, Gert Cauwenberghs3, Seong-Jin Kim2

Paper 15.7 Affiliation: 1Ulsan National Institute of Science and Technology, Ulsan, Korea, 2Sogang University, Seoul, Korea,

3University of California, San Diego, CA

Subcommittee Chair: Rikky Muller, University of California, Berkeley, CA

CONTEXT AND STATE OF THE ART

● Next-generation brain-computer interfaces require high-density multi-electrode arrays that produce massive amounts of raw

data that must be reduced on-chip to enable wireless operation.

● Although deep-learning methods have improved Ventricular Arrhythmia detection, they suffer from unquantified uncertainties

that limit their reliability in critical medical decisions and impede widespread adoption in trustworthy smart health applications.

● A two-electrode recording IC suffers from large common-mode interferences, potentially saturating an analog front-end (AFE)

or resulting in large common-mode to differential-mode conversion.

TECHNICAL HIGHLIGHTS

● Delft University introduces a spike-sorting chip with >10 lower power and area and >3 more channels than

previous designs

o A 1024-ch spike sorting chip that achieves 74nW/ch, 0.00029mm2/ch and >87% accuracy by using event-driven

spike detection and spatial features.

● University of Notre Dame presents the first end-to-end uncertainty-quantifiable Bayesian Convolutional Neural

Network accelerator for ventricular arrhythmia detection

o This chip enables fully parallel in-memory analog GRNG and vector-matrix multiplication operations and achieves

100% VA detection accuracy, 1.75μJ/inference energy and 360fJ/sample GRNG.

● Ulsan National Institute of Science and Technology presents an AFE for two-electrode bio-potential recording

systems with very high common-mode interference (CMI) tolerance and common-mode rejection

o The proposed CMI-Follower allows the common-mode input source itself to drive the floating chip ground,

dynamically suppressing the CMI up to 133VPP with a power consumption of 4.6μW.

APPLICATIONS AND ECONOMIC IMPACT

● Energy-efficient and multi-channel spike sorting enables significant data reduction and low-latency close-loop neural interfaces

that have the potential to improve motor and speech decoding devices.

● The proposed detection engine facilitates low-power and uncertainty-quantification-enabled Ventricular Arrhythmia detection,

ensuring reliable performance under out-of-distribution data, hardware imperfections and temperature variations.

● An AFE that suppresses large CMIs and achieves high CMRR and low power is crucial for mobile healthcare applications

where power efficiency and CMI resilience are essential.

Session 16 Overview: Highlighted Chip Releases:

Digital and Machine Learning Processors

Invited Industry Session

Session Chair: Alicia Klinefelter, NVIDIA, Durham, NC

Session Co-Chair: Vivek De, Intel, Hillsboro, OR

This session highlights recent, productized advancements in digital processors and systems for AI acceleration and high-performance

networking. Key innovations include novel architectures for AI processing, sophisticated memory architectures, and advanced switch

chip designs. The work presented in this session demonstrates significant improvements in AI capabilities, power efficiency, and

scalability across various applications, from mobile devices to data centers. The session provides an overview of current state-of-the-art

hardware solutions addressing the demands of modern computing workloads.

• In Invited Paper 16.1, Broadcom presents Tomahawk5, a switch chip for data centers and AI environments, offering 51.2Tb/s switching capacity.

The chip includes 512 lanes of 106.4Gb/s PAM4 SerDes, supports multiple I/O configurations, has a 168MB shared-buffer architecture, and is

designed as a monolithic die with 9,352 pins and six ARM processor cores, achieving 450W typical power consumption in actual deployments.

• In Invited Paper 16.2, FuriosaAl presents their 5nm inference chip that utilizes parallelism and data locality inherent in tensor contraction, featuring

HBM3 memory, slice redundancy, and a high-bandwidth NoC. It delivers 256TFLOPS/512TOPS for BF16/INT8 with 256MB of on-chip SRAM,

and achieves 12.3 queries/s at 99% accuracy for GPT-J.

• In Invited Paper 16.3, Samsung introduces their Exynos 2400 SoC, used in the Galaxy S24 smartphone. It features a 4nm EUV process and

supports on-device AI with a heterogeneous NPU architecture consisting of vector engines and two types of tensor engines. The 4nm SoC

achieves improved AI performance through better heat dissipation using fan-out wafer level packaging, with the NPU occupying 12mm² and

operating at 0.55 to 0.83V and 533 to 1196MHz.

• In Invited Paper 16.4, SambaNova discusses their SN40L, a Reconfigurable Dataflow Unit for AI applications. It features a three-tier memory

system and dual accelerator dies on a CoWoS-S interposer. Each socket delivers 640 BF16 TFLOPs, and includes 520MB on-chip SRAM. The

SN40L’s architecture enables improved energy efficiency and performance capable of hosting up to 5 trillion parameter models.

Session 16 Highlights: Highlighted Chip Releases:

Digital and Machine Learning Processors

[16.1] Tomahawk5: 51.2Tb/s 5nm Monolithic Switch Chip for AI/ML Networking

Paper Authors: Asad Khamisy1, Mohan Kalkunte1, Peter Del Vecchio1, Yokai Cheok1, Greg Barsky1, Karlheinz Muth1, Reza Sharifi2

Paper Affiliation: 1Broadcom, San Jose, CA, 2Broadcom, Irvine, CA

CONTEXT AND STATE OF THE ART

Advancements in switch chip technology address the growing demands of data centers and AI applications for higher bandwidth,

lower latency, and improved power efficiency. The innovations in chip design, packaging, and optical integration are key steps

forward in the evolution of high-performance networking infrastructure.

High-speed links in advanced chip designs face significant signal integrity challenges, including isolation issues, insertion loss, and

return loss, which are exacerbated by traditional BGA patterns and ball pitches. Approaches such as custom BGA patterns

and advanced packaging techniques are required.

TECHNICAL HIGHLIGHTS

Broadcom's Tomahawk5 is a 51.2Tb/s Ethernet switch chip featuring 512 lanes of PAM4 SerDes, support for various

connectivity options including co-packaged optics, and innovative power-saving techniques, all implemented in a

monolithic, 5nm die.

Achieves low power consumption through various techniques, including multi-bin Adaptive Voltage Scaling (AVS). It

typically consumes about 450W, representing a 30% power reduction from the previous generation.

Integrates Broadcom's Peregrine SerDes technology, providing native support for multiple physical connectivity options

including direct attach copper (DAC), front panel pluggable optics, linear pluggable optics (LPO), and co-packaged

optics (CPO). This allows data center operators to choose the most cost-effective connectivity solutions.

APPLICATIONS AND ECONOMIC IMPACT

The high bandwidth and advanced features of Tomahawk 5 are specifically designed to handle AI and machine learning traffic

patterns. This allows data centers to more efficiently run these in-demand and compute-intensive workloads.

A single Tomahawk5 chip can replace multiple previous-generation switches, allowing data centers to consolidate equipment. This

reduces networking hardware costs. The increased switching capacity and reduced power also translates to improvements in

energy efficiency and data center cost savings.

Session 16 Highlights: Highlighted Chip Releases:

Digital and Machine Learning Processors

[16.2] RNGD: A 5nm Tensor Contraction Processor for Power-Efficient Inference on

Large Language Models

Paper Authors: Sang Min Lee1, Hanjoon Kim1, Jeseung Yeon1, Minho Kim1, Changjae Park1, Byeongwook Bae1, Yojung Cha1,

Wooyoung Choe1, Jonguk Choi1, Younggeun Choi1, Ki Jin Han2, Seokha Hwang1, Kiseok Jang1, Jaewoo Jeon1, Hyunmin Jeong1,

Yeonsu Jung1, Hyewon Kim1, Sewon Kim1, Suhyung Kim1, Won Kim1, Yongseung Kim1, Youngsik Kim1, Hyukdong Kwon1, Jeong Ki

Lee1, Juyun Lee1, Kyungjae Lee1, Seokho Lee1, Minwoo Noh1, Junyoung Park1, Jimin Seo1, June Paik1

Paper Affiliation: 1FuriosaAI, Seoul, Korea, 2Dongguk University, Seoul, Korea

CONTEXT AND STATE OF THE ART

As large language models become more prevalent and complex, there's a growing need for specialized hardware that can

efficiently handle these workloads.

With data centers increasingly concerned about energy consumption and sustainability, RNGD addresses the need for high-

performance AI acceleration with significantly lower power requirements.

TECHNICAL HIGHLIGHTS

FuriosaAI presents their RNGD AI accelerator optimized for large language models (LLMs) with superior performance

efficiency compared to GPUs for LLM inference tasks.

Uses tensor contraction as a primitive instead of matrix multiplication, enabling massive parallelism and time-axis

pipelining.

Implements a flexible architecture with Processing Elements (PEs) that can be split into smaller compute units called

slices, allowing for adaptable configurations and efficient data reuse.

Demonstrates superior performance efficiency, with the chip delivering 2.7x better performance per watt than the H100

and 4.1x better than L40s GPUs for LLM inference tasks.

APPLICATIONS AND ECONOMIC IMPACT

LLMs are revolutionizing businesses by enhancing decisions, customer interactions, and industry-wide innovation. As these

models evolve, the demand for specialized AI accelerators is growing to efficiently handle increasingly sophisticated LLM

workloads.

By 2027 AI servers could use between 85 and 134 terawatt hours (TWh) annually. There is an increasing demand for companies

to prioritize electricity consumption in the development of the next generation of AI hardware and software. The significantly

lower power consumption of RNGD could lead to substantial cost savings for data centers.

Session 16 Highlights: Highlighted Chip Releases:

Digital and Machine Learning Processors

[16.3] An On-Device Generative AI Focused Neural Processing Unit in 4nm Flagship

Mobile SoC with Fan-Out Wafer-Level Package

Paper Authors: Jun-Seok Park, Taehee Lee, Heonsoo Lee, Changsoo Park, Youngsang Cho, Mookyung Kang, Heeseok Lee, Jinwon

Kang, Taeho Jeon, Dongwoo Lee, Yesung Kang, Kyungmok Kum, Geunwon Lee, Hongki Lee, Minkyu Kim, Suknam Kwon, Sung-

beom Park, Dongkeun Kim, Chulmin Jo, HyukJun Chung, Ilryoung Kim , Jongyoul Lee

Paper Affiliation: Samsung Electronics, Hwaseong, Korea

CONTEXT AND STATE OF THE ART

NPUs offer superior energy efficiency compared to both CPUs and GPUs. This makes them particularly valuable for edge

computing environments where power constraints are critical.

FOWLP (Fan-Out Wafer-Level Packaging) significantly improves thermal characteristics, and reduces thermal resistance. This

thermal improvement allows for increased clock frequencies without an increase in power, resulting in better AI performance.

TECHNICAL HIGHLIGHTS

Samsung presents their 4nm Exynos™ 2400 NPU with a heterogeneous architecture, supporting

various neural networks and increasing the maximum clock frequency by 30% while

consuming the same power due to FOWLP.

Uses a heterogeneous NPU architecture with vector engines and two types of tensor engines (8K and 512 MACs). It

supports diverse neural networks, including memory-intensive LLMs and compute-intensive LVMs.

Has memory hierarchy with Q-caches and prefetching maximizes data reuse and reduces latency, particularly effective

for CNNs.

Implements a skewness curve-based tiling method that optimizes matrix/tensor tiling for maximum data reuse within

memory constraints.

APPLICATIONS AND ECONOMIC IMPACT

The semiconductor market is projected to reach around $400 billion by 2030, with AI semiconductors expected to make up over

30% of this market. Traditional CPUs struggle to handle AI workloads, while GPUs are large, expensive, and energy-intensive.

This has led to the development of NPUs as a more specialized and efficient solution for AI processing.

Session 16 Highlights: Highlighted Chip Releases:

Digital and Machine Learning Processors

[16.4] SambaNova SN40L: A 5nm 2.5D Dataflow Accelerator With Three Memory Tiers

for Trillion Parameter AI

Paper Authors: Raghu Prabhakar, Junwei Zhou, Darshan Gandhi, Youngmoon Choi, Mahmood Khayatzadeh, Kyunglok Kim, Uma

Durairajan, Jeongha Park, Satyajit Sarkar, Jinuk Luke Shin

Paper Affiliation: SambaNova Systems, Palo Alto, CA

CONTEXT AND STATE OF THE ART

LLMs and other AI models have been growing exponentially in size, requiring more computational power and memory capacity. As

these AI models become larger and more complex, there's a growing need for more energy-efficient hardware solutions.

Modern AI applications often require the ability to run multiple models or switch between models quickly, necessitating flexible

hardware architectures.

TECHNICAL HIGHLIGHTS

SambaNova presents the SN40L, a 5nm AI accelerator with a flexible on-chip dataflow architecture offering high

performance and energy efficiency for modern AI training and inference applications.

Utilizes a dual-die design with 102 billion transistors per socket, delivering 640 BF16 TFLOPs and 520 MB on-chip

SRAMs.

Implements a three-tier memory hierarchy with on-chip SRAM, on-package High Bandwidth Memory (HBM), and off-

package DRAM, enabling efficient handling of large AI models.

Includes a Power Estimation Unit (PEU) that dynamically adjusts operating conditions within microseconds, optimizing

performance while staying within power limits.

Can host up to 5 trillion parameter models in an 8-socket system.

APPLICATIONS AND ECONOMIC IMPACT

LLMs with a higher number of parameters can capture more intricate patterns and nuances in data, leading to improved

performance. Additionally, as AI applications evolve, the demand for larger models will grow. Accelerators that support many

parameters enable improved performance, scalability, and future-proofing.

AI accelerators that dynamically adjust operating conditions to optimize performance under varying workload demands are crucial

for managing the computational requirements of different phases in LLM processing, such as the compute-intensive prompt

phase and the lighter generation phase.

Session 17 Overview: Hardware Security

Security Subcommittee

Session Chair: Takeshi Sugawara, University of Electro-Communications, Tokyo, Japan

Session Co-Chair: Leibo Liu, Tsinghua University, Beijing, China

The session covers hardware security solutions across design hierarchies, from processor architectures to circuits. The first paper

introduces a sensor for detecting side-channel eavesdropping attacks using the silicon debug tool, the laser voltage prober. The next two

papers present processor architectures designed to accelerate fully homomorphic encryption (FHE) and FHE-based cryptographic

schemes. These are followed by two papers that discuss physically-unclonable-function (PUF) circuits with extremely low error rates.

The final paper highlights a clock generator designed to detect fault injection attacks.

● In Paper 17.1, National University of Singapore demonstrates on-chip detection for laser voltage probing attacks without using an explicit sensor

through transistor leakage shifted by a laser. The proposed detection principle, implemented in a 28nm process, exhibits an overhead of

approximately 0% in power, 1.23% in performance, and 4.35% in area.

● In Paper 17.2, Tsinghua University describes a reconfigurable multi-scheme processor for FHE, enabling energy-efficient execution of the BGV,

BFV and CKKS schemes. The processor, fabricated as a 5.4mm2 chip using a 28nm process, achieves 4.05μJ/encryption at 0.7V for client-side

operations and 20.92kHMul/s at 1.0V for server-side evaluations.

● In Paper 17.3, National Taiwan University introduces a processor for secure multi-party computation using the multi-key (MK) CKKS scheme. The

chip, fabricated using a 40nm process, reaches a maximum throughput of 6.72GOPS with 221mW power consumption at 210MHz from a 1.3V

supply.

● In Paper 17.4, Samsung describes a PUF circuit, implemented in both 3nm GAA and 8nm FinFet technologies. A Vth tilting technique for screening

of unstable selections is utilized to achieve a key error rate (KER) below 5.5E-20.

● In Paper 17.5, University of Ulm presents an eye-opening arbiter (EOA) PUF circuit, implemented in 28nm technology featuring an auto-error

detection, which achieves a bit-error rate of 2E-8 over aging, temperature and supply voltage variation.

● In Paper 17.6, Nvidia presents a compact self-calibrating RC oscillator for detecting clock attacks. The proposed 0-trim self- calibrating oscillator

in 3nm FinFET technology, provides a stable 100MHz clock with 11.6ps period jitter and frequency error of ±0.26%.

Session 17 Highlights: Hardware Security

[17.1] Sensor-Less Laser Voltage Probing Attack Detection via Run-Time Leakage Shift

Monitoring with 4.35% Area Overhead

[17.2] A 28nm 4.05



J/Encryption 8.72kHMul/s Reconfigurable Multi-Scheme Fully

Homomorphic Encryption Processor for Encrypted Client-Server Computing

Paper 17.1 Authors: Hui Zhang1, Longyang Lin2, Dingyi Xiong1, Massimo Bruno Alioto1

Paper 17.1 Affiliation: 1National University of Singapore, Singapore, Singapore, 2Southern University of Science and Technology,

Shenzhen, China

Paper 17.2 Authors: Sijia Lu1, Wenping Zhu1, Bohan Yang1, Jiajun Yang1, Tongwei Dai1, Chen Chen1, Xiangdong Han1, Jinjiang

Yang2, Hanning Wang1, Min Zhu3, Shaojun Wei1, Aoyang Zhang1, Leibo Liu1

Paper 17.2 Affiliation: 1Tsinghua University, Beijing, China, 2Wuxi Research Institute of Applied Technologies, Tsinghua University,

Wuxi, China, 3Wuxi Micro Innovation Integrated Circuit Design Company Ltd., Wuxi, China

Subcommittee Chair: Ingrid Verbauwhede, KU Leuven, Leuven, Belgium, Security

CONTEXT AND STATE OF THE ART

● Resource-constrained embedded systems are vulnerable to sophisticated physical attacks such as laser voltage probing and

require the development of low-overhead on-chip solutions for attack detection and mitigation.

● Fully homomorphic encryption (FHE) provides a cryptographic framework for privacy-preserving computation, but its compute

and memory bottlenecks necessitate the design of efficient and flexible hardware accelerators.

TECHNICAL HIGHLIGHTS

● National University of Singapore introduces an on-chip sensor-less laser voltage probing attack detector circuit in

28nm CMOS achieving significantly reduced power and area overhead compared to state-of-the-art.

o Leakage shifts in sleep transistors in the presence of a laser are used for attack detection, while preserving standard-

cell design and achieving low overhead of approximately 0% in power, 1.23% in performance and 4.35% in area.

● Tsinghua University introduces a multi-scheme FHE processor in 28nm CMOS for client-side encryption / decryption

and server-side encrypted computation with improved energy efficiency compared to state-of-the-art.

o A reconfigurable array of processing elements is used for execution of BGV, BFV and CKKS levelled FHE schemes,

achieving 4.05J/encryption for client-side operations and 20.92kHMul/s for server-side evaluations.

APPLICATIONS AND ECONOMIC IMPACT

● Advanced circuit techniques for low-cost on-chip laser voltage probing attack detection without the use of explicit sensors

enable low-power solutions for improved resilience against sophisticated physical attacks.

● Energy-efficient hardware acceleration of multiple FHE schemes enables computation on encrypted data, e.g., machine-

learning tasks, suitable for privacy-sensitive applications such as big data, finance and healthcare.

● Hardware security innovations are demonstrated across design hierarchies through circuit and architecture optimizations.

Session 18 Overview:

Noise-Shaping and SAR-Based ADCs

Data Converter Subcommittee

Session Chair: Xiyuan Tang, Peking University, Beijing, China

Session Co-Chair: Pieter Harpe, Eindhoven University of Technology, Eindhoven, The Netherlands

Session Chair: Ying-Zu Lin*, Mediatek, Hsinchu, Taiwan

This session presents eight ADCs where noise-shaping techniques and SAR/Pipe-SAR structures are well exploited. These papers are

sorted in ascending order of speed. With <1MHz bandwidth, the first 4 ADCs achieve over 90dB SNDR with more than 180dB Schreier

FoM (FoMS). Ranging from 2.5 to 80MHz bandwidth, the following 3 ADCs have FoMs >170dB. The last paper is a cryo-CMOS CI-SAR

ADC with a 4fF input capacitance.

• In Paper 18.1, UESTC presents a fully dynamic NS-SAR ADC achieving 120.6dB SNDR and 132.5dB SFDR in 1kHz BW, resulting in 189.2dB

FoMS. The performance is enabled by predict and skip, hybrid mismatch shaping, system-level chopping, and FIA.

• In Paper 18.2, Peking University introduces a Gain-Embedded Bootstrapped Sampler input stage adopted in zoom pseudo-pseudo-differential

ADC. This ADC achieves 99.6dB SNDR over 4kHz bandwidth and a FoMS of 184.8dB.

• In Paper 18.3, Peking University proposes a calibration-free pipelined-SAR ADC with cross-stage gain-mismatch error shaping. Inherently

preserving residue voltage for noise-shaping, the prototype achieves 180.4dB FoMS.

• In Paper 18.4, Peking University presents an incremental noise-shaping pipeline ADC with single amplification based kT/C noise cancellation.

This ADC realizes an SNDR of 92.5dB at 1.6MS/s and 467.3uW power, leading to an FoMS of 184.8dB.

• In Paper 18.5, Peking University introduces a 3rd-order noise-shaping SAR ADC with a continuous-time correlated level-shifting embedded ADC

driver. This ADC achieves 82.9dB SNDR at 40MS/s, with an FoMS of 176.3dB.

• In Paper 18.6, Peking University shows an easy-drive 16MS/s pipelined-SAR ADC with a split coarse-fine input buffer sampling scheme. This

ADC achieves 79.4dB SNDR and an FoMS of 176.3dB.

• In Paper 18.7, Tsinghua University presents a filter-embedded pipe-SAR ADC with a progressive conversion scheme and a dynamic floating-

charge transferrer. This ADC achieves 70.1dB over 80MHz BW with 172.2dB FoMS; it provides >30dB out-of-band suppression.

• In Paper 18.8, KU Leuven reveals a 7b cryo-CMOS charge-injection SAR ADC with a low input capacitance of 4fF. It achieves an SNDR of 40.7dB

at a temperature of 6.5K while consuming 2.89mW at a sampling rate of 800MS/s.

*1981-2024

Session 18 Highlights:

Noise-Shaping and SAR-Based ADCs

[18.3] A 93.3dB SNDR, 180.4dB FoMS Calibration-Free Noise-Shaping Pipelined-SAR

ADC with Cross-Stage Gain-Mismatch Error-Shaping Technique and Negative-R-

Assisted Residue Integrator

[18.7] A 70dB-SNDR 80MHz-BW Filter-Embedded Pipe-SAR ADC Achieving 172dB FoMS

with Progressive Conversion and Floating Charge-Transfer Amplifier

Paper 18.3 Authors: Jihang Gao, Yaohui Luan, Siyuan Ye, Xinhang Xu, Zhuoyi Chen, Jiajia Cui, Ru Huang, Linxiao Shen

Paper 18.3 Affiliation: Peking University, Beijing, China

Paper 18.7 Authors: Siyu Huang, Zhishuai Zhang, Xiyu He, Mingyang Gu, Yunsong Tao, Yi Zhong, Nan Sun, Lu Jie

Paper 18.7 Affiliation: Tsinghua University, Beijing, China

Subcommittee Chair: Jan Westra, Broadcom, Bunnik, The Netherlands, Data Converters

CONTEXT AND STATE OF THE ART

● Non-idealities such as gain error, DAC mismatch, and clocking skew pose a limit to prevent ADCs from achieving high

linearity. On the other hand, thermal and kT/C noise induce a tradeoff between SNR and power/area.

● High-precision ADCs usually require complicated architecture and circuit techniques to enhance SNR and linearity. The

consequent overhead is various foreground/background calibrations.

● With the advancement of ADC systems, the peripheral circuits including filters and buffers, become increasingly important for

overall system performance/cost.

TECHNICAL HIGHLIGHTS

● Peking University introduces a calibration-free pipeline-SAR ADC with hardware-efficient cross-stage gain-mismatch

error shaping.

o The calibration-free prototype achieves 93.3dB SNDR, 180.4dB SNDR FoMS, which is the highest among previous

calibration-free ADCs with gain/mismatch error suppression techniques. It also has the largest gain-error tolerance

range.

● Tsinghua University presents a filter-embedded pipeline-SAR ADC with progressive conversion and floating charge-

transfer amplifier.

o The progressive conversion mitigates the speed penalty of the filtering operation, and a dynamic floating charge

transfer realizes high-speed, high-efficiency residue amplification. It realizes 70.1dB SNDR over 80MHz BW, and

provides >30dB out-of-band suppression for full-scale blockers.

APPLICATIONS AND ECONOMIC IMPACT

● Hardware/time-efficient calibration or calibration-free high-precision noise-shaping ADCs will be attractive for industry mass

production.

● kT/C noise poses a fundamental limitation to high SNR design. kT/C cancellation not only decreases the power consumption

of the ADC, but also the reduced input capacitance is power friendly to its input driver.

● ADCs with embedded filtering capability can ease system integration, making them suitable for compact, cost-sensitive

industries such as consumer electronics and IoT.

Session 19 Overview:

Frequency Synthesizers and Series-Resonance VCOs

RF Subcommittee

Session Chair: Dmytro Cherniak, Infineon Technologies, Austria

Session Co-Chair: Wei Deng, Tsinghua University, Beijing, China

Frequency synthesizers are essential components of any wireless communication systems and modern SerDes applications. The session

features elven papers on ultra-low-noise fractional-N PLLs, LO systems, and series-resonance VCOs. The final paper presents a PLL

with 15.8fs jitter.

• In Paper 19.1, the University of Macau presents a 28nm-CMOS 5.5GHz fractional-N cascaded RO-based digital PLL with voltage-domain

feedforward noise cancellation achieving a 290.8fs jitter with <10% variation over PVT.

• In Paper 19.2, the University of Tokyo presents a fractional-N cascaded PLL that uses two MMDs with a shared DSM for quantization noise

cancellation. The PLL, implemented in 65nm CMOS, achieves 96fsrms jitter, a -70.6dBc worst-case fractional spur, and a -247.1dB FoM near

5.2GHz.

• In Paper 19.3, MediaTek presents a 9.5GHz fractional-N PLL with a multipath feedback technique and implemented in 22nm CMOS. The PLL

demonstrates 34fs jitter, -64.8dBc fractional spur, and -255.5dB FoM.

• In Paper 19.4, University College Dublin presents an 8.1-to-9.9GHz single-core oscillator, which employs pseudo-series-resonance topology. This

oscillator, implemented in 28nm CMOS, achieves -128dBc/Hz phase noise at a 1MHz offset.

• In Paper 19.5, the South China University of Technology presents a 7.7-to-9.1GHz differential series-resonance CMOS VCO with a pole-

convergence technique. The VCO achieves -152.9dBc/Hz phase noise at a 10MHz offset when implemented in 65nm CMOS.

• In Paper 19.6, KAIST describes a fast-hopping LO that uses an SSB-mixer-based structure. Implemented in 28nm CMOS, the LO achieves sub-

nanosecond settling, −56dBc worst-case spur, and 60.3fs rms jitter.

• In Paper 19.7, the University of Macau shows a 27GHz fractional-N subsampling PLL achieving 57.9fsrms jitter, -249.7dB FoM, and 1.98μs locking

time in a 28nm CMOS technology.

• In Paper 19.8, the Chinese Academy of Sciences presents a 0.65V 10.4-to-11.8GHz low-voltage low-jitter fractional-N sampling PLL that uses a

hybrid cascaded digital pre-distortion technique, low-jitter low-voltage digital-to-time converter, and low-voltage sampling phase detector.

• In Paper 19.9, Tsinghua University describes a FMCW PLL with an overlap-snapshot posterior-segmentation DPD technique and a cycle-slipping-

immune TDC for ultra-high-slope chirp generation. It achieves 0.043% rms frequency error under 3.4GHz/μs slope and 5.32GHz bandwidth. The

proposed DPD converges within 5 chirp cycles.

• In Paper 19.10, ETH Zürich proposes a PLL-XO co-design where XO provides a clean reference to the PLL and PLL provides the precise injection

timing with a narrow angle for the pulse XO driver. The proposed design achieves -255.2dB FoMJ including all the power and noise of the frequency

synthesis chain.

• In Paper 19.11, Xidian University shows a 13GHz charge-pump PLL with a low-noise resistive-discharge time-amplifying phase-frequency detector

and a series-resonance voltage-controlled oscillator achieving 15.8fs jitter and a -98.5dBc reference spur.

Session 19 Highlights:

Frequency Synthesizers and Series-Resonance VCOs

[19.2] A 96fsrms-Jitter, -70.6dBc-Fractional-Spur Cascaded PLL Employing Two MMDs

with Shared DSM for Quantization Noise Cancellation

[19.4] An 8.1-to-9.9GHz Single-Core Pseudo-Series-Resonance Oscillator Achieving -

128.7dBc/Hz PN at 1MHz

[19.10] A 4.6GHz 63.3fsrms PLL-XO Co-Design Using a Self-Aligned Pulse-Injection

Driver Achieving -255.2dB FoMJ Including the XO Power and Noise

Paper 19.2 Authors: Haoming Zhang, Yuyang Zhu, Masaru Osada, Tetsuya Iizuka

Paper 19.2 Affiliation: University of Tokyo, Tokyo, Japan

Paper 19.4 Authors: Jiawen Chen1, Kai Xu2, Teerachot Siriburanon1, Robert Bogdan Staszewski1

Paper 19.4 Affiliation: 1University College Dublin, Dublin, Ireland, 2King's College London, London, United Kingdom

Paper 19.10 Authors: Can Livanelioglu, Long He, Jiang Gong, Sina Arjmandpour, Taekwang Jang

Paper 19.10 Affiliation: ETH Zürich, Zurich, Switzerland

Subcommittee Chair: Brian Ginsburg, Texas Instruments, Dallas, TX, RF Subcommittee

CONTEXT AND STATE OF THE ART

● Low-phase-noise PLLs are essential for new high-bandwidth wireless communication standards.

● Cascading PLL architecture demonstrates outstanding jitter and spurious performance.

● A new series-resonance VCO topology stands out as a viable way to further reduce the phase noise required for future 6G

wireless communication.

TECHNICAL HIGHLIGHTS

● The University of Tokyo presents a 5.7GHz fractional-N cascaded PLL employing two MMDs with shared DSM for

quantization noise cancellation

o A cascaded fractional-N PLL achieves 96fsrms-Jitter, –70.6dBc fractional spur, and -247.1dB FoM.

● University College Dublin presents an 8.1-to-9.9GHz single-core oscillator with pseudo-series-resonance topology.

o An oscillator achieves -128dBc/Hz phase noise at a 1MHz offset and 189.8-to-191.4dBc/Hz FoM at 1MHz.

● UTH Zunich presents a PLL-XO co-design where XO provides a clean reference to the PLL and PLL provides the

precise injection timing with a narrow angle for the pulse XO driver.

o The PLL achieves 63fs jitter and -255dB FoM with 2.25mW power consumption.

APPLICATIONS AND ECONOMIC IMPACT

● Advanced low-phase-noise PLLs are required for high-bandwidth wireless communication.

● Digital techniques can be applied to reduce calibration time and overall system cost.

● Lower power consumption while preserving performance enables the integration in battery-powered devices.

Session 20 Overview:

Sensors and Actuators for Health and Autonomy

Technology Directions Subcommittee

Session Chair: Milin Zhang, Tsinghua University, China

Session Co-Chair: Kyeongha Kwon, KAIST, Korea

Sensing and actuation technologies are becoming specialized and diverse, with a focus on multi-modality, miniaturization and power

efficiency for innovative applications. The session begins with gas sensing technologies, featuring a multi-modal gas sensor chip

and a nose-on-a-chip with on-chip learning for non-invasive disease diagnosis. The next paper presents a novel RFID-inspired bio-

analyzer for point-of-care diagnostics, followed by a MEMS-free e-nose sensor array. This session then explores biochemical

diagnostic solutions and autonomous/adaptive systems, featuring a wireless electrochemical sensing SoC, a self-propelling

microrobot with sub-GHz energy harvesting, a biochemical chip with precise temperature control, a battery-free silicon platform for

adaptive sensing, and an autonomous microactuator system with flying solid-state batteries. A BodyID system featuring a clockless

wake-up receiver and a γ photon spectrometer for precision cancer resection round out the session.

• In Paper 20.1, Osaka University demonstrates a 16-channel multi-modal gas sensor employing membrane-based stress sensors integrated

with readout electronics. The system is demonstrated to measure the concentrations of multiple gasses, while maintaining state-of-the-art

sensitivity, sample rate, and power consumption.

• In Paper 20.2, Tsinghua University presents an E-nose system with on-chip incremental learning for non-invasive diagnosis of diseases. The

chip consumes 67μW/Channel and obtains a 98% accuracy for 10-class gas recognition with one-shot learning.

• In Paper 20.3, University of California at Berkeley, presents a fully wireless CMOS/microfluidics bioanalyzer for point-of-care use. It integrates

amperometric, pH, and temperature sensors, which, combined with PDMS vacuum-actuated microfluidics, enables immunoassays to be

demonstrated.

• In Paper 20.4, ETH Zurich demonstrates an E-nose with 4096 capacitance-gas interfacing pixels with sigma-delta ADCs and in-pixel heaters

for thermodynamic modulation and reusability. Metal-Organic Framework (MOF) materials, provide selectivity and achieve a LoD of 5 ppm.

• In Paper 20.5, Tsinghua University presents a 4.3×4.1×4 mm3 16-channel wireless electrochemical sensor with integrated thin-film Organic

Electrochemical Transistor, for 0.1pM LoD. Ultrasonic power and communications employ an active rectifier and backscatter OOK

modulation.

• In Paper 20.6, Shanghai Jiao Tong University presents a self-propelling microrobot that moves through ionic fluids at mm/s. It employs on-

chip traveling wave electroosmosis pump for actuation and a sub-GHz parity-time symmetry energy harvesting method for enhanced energy

transfer efficiency during movement.

• In Paper 20.7, ADI demonstrates a ASIC comprising 384 thermally-isolated sites with individual temperature control. Each site can be

controlled with +/- 0.12°C precision enabling double-strand DNA purification via selective melting.

• In Paper 20.8, University of Macau proposes a distributed, adaptive, and event-driven sensing framework. A battery-less chip compresses

the sensor data to reduce the wireless transmission payload, limiting the power to 94.8nW.

• In Paper 20.9, University of California, San Diego and CEA-Léti presents a high-voltage microactuator drive system without extra passive

components. It generates up to 56.1V output enabling over 50 hours continuously working at 20Hz with a total weight of 1.8g.

• In Paper 20.10, University of California at Berkeley presents a 200GHz 200-pixel 2-D near-field imager employing active split-ring resonators

(ASRRs) with Q-boosting for biomedical applications. The system achieves 46µm×54µm pixel dimensions and demonstrates tissue imaging

with 27dB pixel-to-pixel isolation.

• In Paper 20.11, Purdue University demonstrates an asynchronous, ultra-low-power wake-up receiver and a crystal-less clock recovery circuit

designed for on-body communication. The reported system achieves a startup time of less than 80µs while consuming under 425.2nW.

• In Paper 20.12, University of California at Berkeley demonstrates a configurable Gamma-photon spectrometer for precision radio guided

cancer resection. The system measures the voltage decay on a small reverse biased diode to resolve the energy with sub-keV resolution

and 1.315 MeV dynamic range.

Session 20 Highlights:

Sensors and Actuators for Health and Autonomy

[20.3] An RFID-inspired One-step Packaged Multi-mode Bio-analyzer with Vacuum

Microfluidics for Point-of-Care Diagnostics

[20.7] A 384-site chip platform for biochemical applications with individual site

precision temperature control

Paper 20.3 Authors: Yan-Ting Hsiao, Ya-Chen Tsai, Wei Foo, Hung-Yu Hou, Yun-Chun Su, Yueting Lily Li, Jun-Chau Chien

Paper 20.3 Affiliation: University of California at Berkeley, Berkeley, CA, National Taiwan University, Taipei City, Taiwan

Paper 20.7 Authors: Michael Coln, Qingdong Meng, Viorel Bucur, Ramji Lakshmanan, Aditya Yadav, David Lloyd, Nicol Ferri, Mark

Bignell, Daniele Di Nuzzo, Phillip Nadeau, Matthew Hayes, Roman Trogan

Paper 20.7 Affiliation: Analog Devices, Boston, MA, Analog Devices, Limerick, Ireland, Analog Devices, San Jose, CA, Evonetix, Ltd,

Cambridge, United Kingdom

Subcommittee Chair: Ali Hajimiri, California Institute of Technology, Pasadena, CA, Technology Directions

CONTEXT AND STATE OF THE ART

● Integrating millimeter-sized CMOS integrated circuits with microfluidics presents a promising solution for multiplexed bio-

marker detection with low cost and reduced size.

● The integration of microfluidics for more advanced functions presents a significant challenge in terms of packaging and cost.

● Effective thermal isolation, localized heat delivery, and precise local temperature sensing are key to efficient temperature

control with multiple independent channels.

TECHNICAL HIGHLIGHTS

● University of California at Berkeley introduces a near-field inductively powered CMOS bio-analyzer that seamlessly

integrates with PDMS microfluidics, eliminating the need for electrical connections.

o The utilization of naturally-formed Al2O3 oxide as a pH sensing membrane eliminates the need for any post-CMOS

processing.

● Analog Devices demonstrates a chip platform with thermally controlled biochemistry

o The integration of post-processed heated reaction sites on an ASIC enables the individual programming of 384 sites

within the temperature range of 25-125°C, while maintaining minimal thermal crosstalk (< +/- 0.12°C).

APPLICATIONS AND ECONOMIC IMPACT

● Millimeter-sized bioanalyzer for molecular diagnostics enables point-of-care (POC) detection directly at or near the patient’s

location.

● Temperature control with greater throughput enables precise thermally-controlled for biochemical applications.

Session 21 Overview: Compute & USB Power

Power Management Subcommittee

Session Chair: John Pigott, NXP Semiconductors, Chandler, Arizona, USA

Session Co-Chair: Rinkle Jain, Nvidia, Portland Oregon, USA

The overarching theme of this session is design innovation in power converter topologies and control yielding simple and robust circuits

that broadly achieves the following: 1) inherently reducing passive needs; 2) effective volume utilization of existing passives through

continuous conduction; and 3) auto current and voltage balancing across the many phases or passives within.

The session has excellent representation of innovations in the compute and USB area from both industry as well as university researchers

spanning from basic buck converters to hybrid yet inductive converters as well as a resonant SC converter and a digital LDO.

Overall, we have five papers covering 12V to 1V converters with currents over 4.5A.

• In Paper 21.1 from Zhejiang University, Hangzhou, China describes a 12-to-1V two-stage converter with regulated resonant switched-capacitor

regulators using 65nm CMOS and reaching 89.3% efficiency.

• In Paper 21.2 from Samsung Electronics, Hwaseong, Korea introduces a dual-input bidirectional 3-Level battery charger with VCF balancing and

a wide input from 4.65 to 12 V for mobile applications using 130nm BCD and reaching 96.8% efficiency.

• In Paper 21.3 from University of Macau covers a Multi-Phase Hybrid Converter with Inherently Auto-Balanced ILs using a 180nm BCD process

which reaches 92.9% efficiency and can deliver 6A.

• In Paper 21.4 from Korea University, Seoul, Korea covers a half-inductor-current hybrid bidirectional converter performing USB to 2-cell

bidirectional power transfer reaching 97.4% efficiency using a 180nm technology and only requiring 5V transistors.

• In Paper 21.5 from Intel, Santa Clara, California, USA presents a 60MHz integrated multi-phase regulator which features an autonomous mode

transition from hard-switching to soft-switching to discontinuous conduction mode using a 3nm FinFET technology.

• In Paper 21.6 from Sogang University, Seoul, Korea describes a 2A analog distribution LDO for a SoC in a 28nm technology with excellent noise

immunity in an SoC application.

• In Paper 21.7, researchers from University of California San Diego, California, USA describe a merged hybrid and multi-phase implementation of

a 6-phase triple-step-down DC-DC converter which can reach 91% efficiency and deliver 7A using a 180nm process at a current density of

868A/cm³.

• In Paper 21.8 from University of Macau, Macao, China presents a scalable hybrid DC-DC converter with a power ring for current balance and a

back ring for phase shedding, named as ‘HOOP’. Implemented in 180nm BCD, this converter can reach 90.2% efficiency and deliver up to 16A.

• In Paper 21.9, from University of Macau, Macao, China presents a 20MHz & 1MHz dual-loop multi-inductor hybrid DC-DC converter with non-

uniform current allocating optimizing both up and down transient performance as low as 6.3% using a 180nm BCD technology.

Session 21 Highlights: Compute & USB Power

[21.9] A 20MHz & 1MHz Dual-Loop Non-Uniform-Multi-Inductor Hybrid DC-DC Converter

with Specified Inductor Current Allocation and Fast Transient Response

Paper 21.9 Authors: Junwei Huang1, Xiangyu Mao1, Zhiguo Tong1, Zhewen Yu1, Wenjie Yang1, Chi-Seng Lam1, Rui P. Martins1, Yan

Lu1, 2

Paper 21.9 Affiliation: 1University of Macau, Macao, China, 2Tsinghua University, Beijing, China

Subcommittee Chair: John Pigott, NXP Semiconductors, Chandler, Arizona, USA

CONTEXT AND STATE OF THE ART

● Efficient high-performance power conversion to supply modern loads from a 12V source is critical to enable high-end

computing. Higher efficiency and fast transient response reduces volumes and costs. These features are critical for all

compute loads, including servers, portable computers, and automotive applications.

● Innovations in DC-DC architectures, including novel hybrid topologies with hybrid power delivery and multiple regulation loops

continue to advance the state-of-the-art in this field.

TECHNICAL HIGHLIGHTS

● The University of Macau introduces a 20MHz & 1MHz dual-loop non-uniform-multi-inductor hybrid DC-DC converter

with specified inductor current allocation and fast transient response.

o Two hybrid power stages combine to deliver 4.5A by controlling the allocation of power from each stage. While a

1MHz stage delivers most of the power, a 20MHz stage performs load regulation and controls fast transient response

using optimized core devices with a 180nm technology.

APPLICATIONS AND ECONOMIC IMPACT

● Automotive, server and portable compute applications continue to require high quality power delivery at high efficiency and

low volume.

● Hybrid converter topologies allow partitioning of the regulation amongst separate circuits optimized for efficiency and for fast

transient response, while combining and controlling the outputs effectively.

Session 22 Overview: Memory Interface

Memory Subcommittee

Session Chair: Dongkyun Kim, SK Hynix, Korea

Session Co-Chair: Hidehiro Shiga, KIOXIA, Japan

SK hynix and Seoul National University present PAM3 GDDR7 interfaces that can achieve 42Gb/s: SK hynix describes a DFE which

provides feedback within 1UI, and Seoul National University describes a clock-referred transceiver technology. Hanyang University and

Korea University introduce PAM4 interfaces: Hanyang University introduces a 32-50Gb/s transmitter that includes a ZQ-based FFE, and

Korea University introduces a 32Gb/s single-ended receiver that includes a capacitive-feedback equalizer. In addition, Seoul National

University presents a low-power and low-jitter quadrature-clock generator for high-bandwidth HBMs.

• In Paper 22.1, Seoul National University (SNU) presents a low-power (0.275pJ/b) high-speed (42Gb/s/pin) single-ended PAM3 transceiver that

achieves <10-12 BER with 200mVPP injected feed noise.

• In Paper 22.2, Seoul National University introduces a 850μW injection-locked quadrature-clock generator, with 2 - 5GHz jitter filtering and instant

toggling for HBM interfaces.

• In Paper 22.3, SK Hynix presents a 42Gb/s single-ended PAM-3 receiver with a DFE that combines a direct-feedback and loop-unrolled DFE to

minimize area and power overhead for GDDR7 memory interfaces.

• In Paper 22.4, Hanyang University presents a 32 - 50Gb/s/pin single-ended PAM-4 transmitter using a ZQ-based FFE and PAM-4 LSB DBI-DC

encoding.

• In Paper 22.5, Korea University introduces a 32Gb/s/pin single-ended PAM-4 receiver with a delay-less capacitive-feedback equalizer (CFE)

achieving a low-power, 0.3pJ/b, consumption in a 28nm CMOS technology.

Session 22 Highlights: Memory Interface

[22.1] A 0.275pJ/b 42Gb/s/pin Clock-Referenced PAM-3 Supply-Noise-, Reference-

Offset- and Cross-Talk-Tolerant Transceiver for Chiplets and Short-Reach Memory

Interfaces

[22.2] An 850μW 2-to-5GHz Jitter-Filtering and Instant-Toggling Injection-Locked

Quadrature-Clock Generator for Low-Power Clock Distribution in HBM Interfaces

Paper 22.1 Authors: Kahyun Kim1 , Jung-Hun Park2, Ha-Jung Park1, Jia Park1, Jihee Kim1, Woo-Seok Choi1

Paper 22.1 Affiliation: 1Seoul National University, Seoul, Korea, 2University of California at Berkeley, California, USA

Paper 22.2 Authors: Jeongbeom Seo*1, Yoonseo Cho*1,2, Yuhwan Shin1,2, Jaehyouk Choi1

Paper 22.2 Affiliation: 1Seoul National University, Seoul, Korea, 2Korea Advanced Institute of Science and Technology, Daejeon,

Korea

Subcommittee Chair: Dongkyun Kim, SK Hynix, Korea, Hidehiro Shiga, KIOXIA, Japan

CONTEXT AND STATE OF THE ART

● A high-speed and low-power single-ended PAM3 transceiver that uses a differentially-weighted driver and a combined-

sampler DFE to reduce power consumption, and a far-end-reflection (FEXT) cancelation scheme with an FS-puller.

● Injection-locked quadrature-clock generator for low-power and low-jitter clock distribution for HBM interfaces.

TECHNICAL HIGHLIGHTS

● Seoul National University (SNU) presents a low-power (0.275pJ/b) high-speed (42Gb/s/pin) single-ended PAM3

transceiver that achieves <10-12 BER with 200mVPP injected feed noise.

o Forwarded clocks act as reference voltages for supply-noise and reference-offset tolerance. A differentially-

weighted-PAM3 driver and combined-sampler DFE are used to improve power-efficiency, while a XTC-combined FS

puller cancels FEXT.

● Seoul National University (SNU) demostrate a 850μW injection-locked quadrature-clock generator (IL-QCG), with 2 -

5GHz jitter filtering and instant toggling for HBM interfaces.

o IL-QCG has the ability to toggle the output clock instantly, without requiring a continuous clock. It suppresses input-

clock jitter, reducing it from 3.02 to 1.31psRMS, and achieves a PSIJ from 5.80 to 1.73psRMS.

APPLICATIONS AND ECONOMIC IMPACT

● Developing DRAM interface technology is an important technology that can satisfy the high-bandwidth and high-energy

efficiency required by AI computing

● In PAM3, the fastest scheme among DRAM interfaces, improvements such as PSIJ, offset, and cross-talk noise in output

circuits can be a major technology to solve memory BW constraints.

● The precise implementation of the quadrature-clock generator is also an essential and important technology that needs to be

improved in for high-speed DRAM data transmission.

Session 23 Overview: AI Accelerators

Digital Architectures and Systems Subcommittee

Session Chair: Soojung Ryu, Seoul National University, Seoul, Republic of Korea

Session Co-Chair: Hugh Mair, MediaTek, Dallas, TX, USA

AI hardware accelerators are driving innovations in modern computing, influencing generative AI, large language models, and 3D point

cloud analysis. The initial papers focus on reducing external memory accesses and enhancing energy efficiency through hardware-aware

algorithms and novel processing units. Later papers highlight the need for energy-efficient solutions for edge devices and scalable AI

systems, showcasing advancements in optimizing AI hardware for cloud and edge applications. These innovations reflect ongoing efforts

to enhance performance and efficiency in AI hardware design.

• In Paper 23.1, Columbia University and Intel present a 16nm FinFET, 10.15mm2 transformer accelerator that reduces external memory accesses

(EMA) and enhances hardware utilization. Algorithmic model compression lowers EMA by up to 31×, while the chip achieves processing times of

68-567ms/token and energy consumption of 0.41-3.95mJ/token.

• In Paper 23.2, The Hong Kong University of Science and Technology presents a 28nm 13.93mm2 CNN-Transformer accelerator for semantic

segmentation, achieving 3.86-to-10.91× energy reduction over previous designs. It features a hybrid attention unit, layer-fusion scheduler, and

cascaded feature-map pruner, with peak energy efficiency of 52.90TOPS/W (INT8).

• In Paper 23.3, KAIST presents a 418mJ/inference few-step diffusion model accelerator for image generation. The 28nm 20.25mm2 accelerator

achieves 3.3-to-6.8× lower energy consumption than prior work on text-to-image and image-to-image benchmarks.

• In Paper 23.4, Peking University and Reconova Technologies present Nebula, a 28nm 1.5mm2 on-chip accelerator for 3D point cloud analysis.

Nebula employs tree-based adaptive partitioning, multi-skipping, and delayed-aggregation, achieving 109.8TOPS/W energy efficiency,

5.6TOPS/mm² area efficiency, and a 5263.2fps in state-of-the-art benchmarks.

• In Paper 23.5, MediaTek presents a 3nm, 0.168mm² neural processing unit for edge device generative AI. Featuring depth-first fusion, H-reuse

caching, and optimized scheduling, delivering 0.63TOPS with 3.74TOPS/mm² area efficiency. Operating at 0.575V and 546MHz, it achieves

12.38TOPS/W energy efficiency and 84.5% utilization for the TAESD model.

• In Paper 23.6, MIT and Chung-Ang University propose a 28nm 12.96mm2 generative AI processor featuring: 1) MEGA.mini architecture for 5.1×

efficiency, 2) an output synchronizer 3) a cross-shaped memory reducing power by 34.7%, and 4) a unified tensor streaming core. It delivers 8.1×

higher efficiency in non-diffusion and 4.1× in diffusion models compared to conventional processors.

• In Paper 23.7, KAIST presents BROCA, a 28nm 20.25mm2 mobile social agent SoC delivering 3.3× speedup and 1.8× lower energy consumption.

Key features include: 1) ABTU with ACER for energy-efficient response generation, 2) ACBU to reduce vocoder power, and 3) LTMU for low-

latency dialogue context retention.

• In Paper 23.8, Tsinghua University presents a 28nm 3.52mm2 LLM accelerator with: 1) an active-bit-allocate format reducing EMA by 3.87×, 2) a

cluster-aligned mixed-precision processing element lowering power by 4.87×, and 3) a bi-dimensional reformulation enhancing efficiency by

22.3%. It achieves a peak energy efficiency of 88.36TOPS/W in LLaMA-2-7B inference.

• In Paper 23.9, KAIST presents Slim-Llama, a 28nm 20.25mm2 ASIC reducing energy consumption in large language models by minimizing

external memory accesses. Using binary/ternary quantization and a sparsity-aware look-up table, it achieves up to 4.59× better energy efficiency

than previous designs and efficiently runs a billion-parameter Llama model at 4.69mW.

• In Paper 23.10, KAIST presents HuMoniX, a 14nm 12mm2 real-time text-to-motion processor using heterogeneous engine clusters, achieving

57.3fps and 12.8TFLOPS/W. It reduces computation with output sparsity and data compaction, and lowers latency by selectively building meshes

based on inter-frame similarity, supporting mixed-precision data for improved efficiency.

Session 23 Highlights: AI Accelerators

[23.1] T-REX: A 68-567



s/Token, 0.41-3.95



J/Token Transformer Accelerator with

Reduced External Memory Access and Enhanced Hardware Utilization in 16nm FinFET

[23.5] MAE: A 3nm 0.168mm2 576MAC Mini Autoencoder with Line-Based Depth-First

Scheduling for Generative AI in Vision on Edge Devices

[23.6] MEGA.mini: A Universal Generative AI Processor with a New Big/Little Core

Architecture for NPU

[23.8] An 88.36TOPS/W Bit-Level-Weight-Compressed Large Language Model

Accelerator with Cluster-Aligned INT-FP-GEMM and Bi-Dimensional Workflow

Reformulation

Paper 23.1 Authors: Seunghyun Moon1, Mao Li1, Gregory K. Chen2, Phil C Knag2, Ram Krishnamurthy2, Mingoo Seok1

Paper 23.1 Affiliation: 1Columbia University, New York, NY, 2Intel, Hillsboro, OR

Paper 23.5 Authors: Shih-Wei Hsieh, Chia-Hung Yuan, Ming-Hung Lin, Ping-Yuan Tsai, You-Yu Nian, Chia-Yuan Cheng, Hung-

Wei Chih, Po-Han Chiang, Ming-Hsuan Chiang, Yuan-Jung Kuo, Yu-Wei Wu, Yi-Syuan Chen, Po-Heng Chen, Sandy Huang,

Ming-En Shih, Chia-Ping Chen, Abrams Chen, ShenKai Chang, Chih-Ming Wang, Po-Yu Yeh, Jett Liu, Yung-Chang Chang,

Chung-Yi Chen, Chi-Cheng Ju, CH Wang, Kevin Jou

Paper 23.5 Affiliation: MediaTek, HsinChu, Taiwan

Paper 23.6 Authors: Donghyeon Han 1,2, Anantha P. Chandrakasan1

Paper 23.6 Affiliation: 1Massachusetts Institute of Technology, Cambridge, MA, 2Chung-Ang University, Seoul, Korea

Paper 23.8 Authors: Yubin Qin, Yang Wang, Jiachen Wang, Zhiwei Lin, Yushu Zhao, Shaojun Wei, Yang Hu, Shouyi Yin

Paper 23.8 Affiliation: Tsinghua University, Beijing, China

Subcommittee Chair: Rahul Rao, IBM, Bangalore, India

CONTEXT AND STATE OF THE ART

● The existing transformer accelerator designs exhibit large external memory access and low hardware utilization, resulting in

high energy consumption and latency, requiring innovations in transformer accelerator architectures.

● Supporting generative AI for vision on edge devices drives the need for the design of tiny NPUs for running autoencoders with

low external memory access and high temporal utilization, which can cooperate with large NPUs allowing workloads with

different characteristics to be handled efficiently.

● LLM models face challenges of high external memory access, mixed precision GEMM, and non-linear function dependencies,

requiring novel LLM accelerator architecture and circuit techniques to address these issues.

● Accelerating both diffusion and non-diffusion models for generative AI requires a universal generative AI processor with a

novel architecture for efficient computation of both models.

TECHNICAL HIGHLIGHTS

● Columbia University and Intel describe a transformer accelerator designed to reduce external memory accesses

(EMA) and enhance hardware utilization.

o Model compression reduces external memory accesses (EMA) by up to 31, while dynamic batching for variable

input lengths further lowers EMA and boosts utilization by 3.3. A custom register file accessible in row- and column-

directions enhances utilization by an additional 1.2, achieving processing times of 68-567s/token and energy

consumption of 0.41-3.95J/token.

● MediaTek presents a 3nm tiny NPU accelerating auto-encoders and -decoders for generative AI on edge devices.

o The accelerator achieves an energy efficiency of up to 12.3TOPS/W and area efficiency of 3.74TOPS/mm2 on dense

workloads due to optimized scheduling of depth-first fused operators and local H-reuse caching with a direct link

between the convolution and add and concatenation units.

● MIT and Chung-Ang University present a heterogeneous neural processing architecture with large fixed-point and

small floating-point cores.

o The 12.96mm² SoC, fabricated in 28nm CMOS, supports diffusion and non-diffusion models with 1.8-to-8.1 higher

efficiency across five operating modes.

● LLM accelerator from Tsinghua University in 28nm CMOS executes a variety of large-scale generative models.

o External memory access is reduced by 3.87 with an active-bit-allocate format, while a mixed-precision matrix

multiply cuts power consumption by 4.87 and a bi-dimensional reformulation boosts efficiency by 22.3%. The

accelerator achieves peak energy efficiency of 88.36TOPS/W for LLaMA-2-7B inference.

APPLICATIONS AND ECONOMIC IMPACT

● An energy-efficient and low-latency transformer accelerator from Columbia University enables a wide range of AI applications

on the edge, including vision, language and speech.

● A tiny NPU running an autoencoder with high energy efficiency and temporal utilization from MediaTek enables generative AI

for vision on edge devices.

● A universal generative AI processor with a new big-little core architecture from MIT enables efficient acceleration of both

diffusion and non-diffusion models for generative AI applications.

● An LLM accelerator with bit-level weight compression and cluster-aligned INT-FP-GEMM from Tsinghua University

considerably improves the energy efficiency of LLM computations in natural language processing applications.

Session 24 Overview: High-Frequency ADCs

Data Converters Subcommittee

Session Chair: Seung-Tak Ryu, KAIST, Daejeon, Korea

Session Co-Chair: Vanessa Chen, Carnegie Mellon University, Pittsburgh, USA

Despite a long history and well-established design techniques, ADC architectures and circuit technologies for high-speed operation

continue to evolve. The conversion rates presented in this session range from 1 to 72GS/s, demonstrating innovative ADC structures

and circuit techniques aimed at achieving not only high speed but also exceptional energy efficiency. Key advancements introduced

include piecewise-linear nonlinearity calibration, dual-path amplification, sub-quantizer design, a mutually exclusive selection technique

to reduce metastability, wideband input buffers, time-interleaved ADC skew calibration using fractional-delay FIR filters, a hierarchical TI

ADC in 4nm CMOS, passive gain combined with automatic buffer power gating, and more.

• In Paper 24.1, Tsinghua University introduces a gated-LMS-based PWL nonlinearity calibration and dual-path amplification for better accuracy

and efficiency. The prototype ADC achieves 58.8dB SNDR at 3GS/s with 32.5mW power consumption, corresponding to a FoMs of 165dB.

• In Paper 24.2, Xidian University presents a 14b 1GS/s pipelined ADC with SAR sub-quantizer, achieving 68.2dB SNDR, 85.8dB SFDR, and

173.3dB FoMS. A dynamic dead-zone ring amplifier enhances efficiency without bias tuning.

• In Paper 24.3, University of Macau presents a 2× interleaved 2.2GS/s ADC using a gated-CCRO quantizer with improved energy efficiency and

PVT robustness, achieving 45.8dB SNDR, 65.7dB SFDR, and 19.7fJ/step FoMw.

• In Paper 24.4, University of Macau presents a 10b 3GS/s time-domain ADC with mutually exclusive selection technique to reduce metastability

errors. It achieves 47.2dB SNDR in low metastability mode and 49.3dB in split ADC mode at Nyquist input.

• In Paper 24.5, University of Macau presents a 16nm FinFET 72GS/s 9b time-interleaved pipeline SAR ADC with wideband input buffers and a

dual-path bootstrapped switch, achieving 55.3dB SFDR and 41.9dB SNDR at 20GHz input.

• In Paper 24.6, MediaTek presents a 4nm FinFET 12b 16GS/s time-interleaving ADC, consuming 570mW and occupying 0.430mm², achieving -

155dBFS/Hz noise spectral density and reducing spurs lower than -68dBFS at 7.4GHz input with digital calibration.

• In Paper 24.7, Tsinghua University presents an 8b 10GS/s time-interleaved pipelined ADC, achieving a sampling rate of 5GS/s per channel.

Automatic buffer power gating improves efficiency, achieving an FoMW of 22fJ/conv-step.

• In Paper 24.8, Analog Devices presents a 12GS/s 9b 16× time-interleaved SAR ADC in 16nm FinFET, achieving 48.1dB SNDR at 5.33GHz,

consuming 160mW, with one reservoir capacitor per bit and SAR loop feedback innovations.

Session 24 Highlights: High-Frequency ADCs

[24.1] A 3GS/s 12b Pipelined ADC with Gated-LMS-Based Piecewise-Linear Nonlinearity

Calibration

[24.5] A 72GS/s 9b Time-Interleaved Pipeline-SAR ADC Achieving 55.3/49.3dB SFDR at

20GHz/Nyquist Inputs in 16nm FinFET

Paper 24.1 Authors: Mingyang Gu, Yi Zhong, Lu Jie, Nan Sun

Paper 24.1 Affiliation: Tsinghua University, Beijing, China

Paper 24.5 Authors: Yannan Zhang1, Minglei Zhang1, Zehang Wu1, Yan Zhu1, R. P. Martins1,2, Chi-Hang Chan1

Paper 24.5 Affiliation: 1University of Macau, Macau, Macau, 2Instituto Superior Tecnico/University of Lisboa, Lisboa, Portugal

Subcommittee Chair: Jan Westra, Broadcom, Bunnik, The Netherlands, Data Converters

CONTEXT AND STATE OF THE ART

● Compact single-channel high-speed ADCs with low power consumption and high linearity are highly required in many

advanced applications such as 6G communication systems.

● On-chip linearity calibration techniques have become nearly mandatory to alleviate the power demands of analog circuits,

leveraging digital-assisted analog approaches.

● As data rates in wireline applications pursue several hundred Gb/s, there is growing interest in massively interleaved ultrahigh-

speed ADCs in compact chip area.

TECHNICAL HIGHLIGHTS

● Tsinghua University introduces a single-channel 12b 3GS/s pipelined ADC with compact on-chip digital piecewise-

linear nonlinearity calibration and dual path amplification for improved speed and energy efficiency.

o A single-channel 28nm CMOS 12b pipelined ADC with dual-path amplification and compact on-chip digital piecewise-

linear nonlinearity calibration demonstrates 9.4 ENOB at 3GS/s with only 32.5mW and FoMS of 165dB.

● University of Macau presents a 64× 72GS/s TI pipelined-SAR ADC with wideband linearized analog front-end,

achieving the highest SFDR among previously published similar CMOS ADCs and best FoMS.

o A 16nm FinFET 72GS/s 9b time-interleaved pipeline SAR ADC with wideband input buffers and a dual-path

bootstrapped switch achieves 55.3dB SFDR and 41.9dB SNDR at 20GHz input.

APPLICATIONS AND ECONOMIC IMPACT

● High-speed ADCs with resolutions exceeding 12 bits and conversion speeds beyond several GHz will be critical enablers for

advancing next-generation wireless communication standards like 6G.

● Ultrahigh-speed ADCs with good linearity are in high demand for wireline communication applications, as they are vital for

managing advanced modulation schemes and higher oversampling rates.

Session 25 Overview: High Concepts at High Frequencies

Technology Directions Subcommittee

Session Chair: Noriyuki Miura, Osaka University, Suita, Japan

Session Co-Chair: Denis Daly, Apple, Waltham, MA

High frequency and high bandwidth integrated circuits that interact with the electromagnetic spectrum in new ways continue to emerge.

The first two papers present two highly analog/mixed-signal implementations of 3-SAT solvers and cross-correlators that demonstrate

how compute in analog domain offers promise to achieve higher throughput and area efficiencies than traditional digital implementations.

The third paper presents an AI-enabled end-to-end algorithm flow for rapid architecture discovery, circuit topology and parameter

optimization for RFICs. The next two papers explore high frequency measurement. First, Paper 25.4 presents a micromachined

heterogeneously integrated active-probe enabling in-situ measurements from DC to 50GHz. The session concludes with a highly

integrated, low power time-domain channel analyzer with a measurement range from DC to 40GHz.

• In Paper 25.1, the University of Michigan presents an oscillator-based 3-SAT solver that leverages physics-inspired heuristics in a mixed-signal

compute fabric. The prototype implements a continuous-time dynamical system and Dynamic CT Injection (DaCTI) to solve 20/50-variable 3-SAT

problems with improved speed and energy efficiency compared to state-of-the-art solvers.

• In Paper 25.2, Washington University and Oregon State University present a fully-analog 256×256 cross-correlator based on sampling-based

approach and margin propagation paradigm. The IC operates up to 4GS/s input and template data rate and achieves 1000 TOPS/W with high

compute density up to 1.3 TOPS/mm2.

• In Paper 25.3, Princeton University demonstrates an AI-enabled end-to-end algorithm flow for architecture discovery, circuit topology and

parameter optimization for RFICs. The RFICs combine reinforcement learning and inverse methods to synthesize mm-Wave and sub-THz PAs

in 90nm SiGe.

• In Paper 25.4, Sandia National Laboratories presents a high-speed, high-impedance integrated silicon micromachined probe bonded to a DC-to-

50GHz sense amp for measurement applications. The amplifier, realized in a 45nm PD-SOI process, achieves a measured 3dB cutoff frequency

up to 50GHz, with < 90fF input capacitance, a 0dBm IP1dB, a 7.5dBm IIP3 and a 4% THD.

• In Paper 25.5, Peking University presents an integrated channel analyzer implemented in a 28nm CMOS process. The TX utilizes a patterned

step technique with a 4:1 clock gated multiplexer to enhance the SNR of the captured waveforms. The sampler adopts a clock booster to reduce

leakage and minimize the hold errors. The integrated channel analyzer achieves an equivalent sample rate of 1.792THz, 9.1ps rise time,

99.5mW/port power consumption, and a measurement range from DC to 40GHz.

Session 25 Highlights: High Concepts at High Frequencies

[25.1] AI-enabled Design Space Discovery and End-to-end Synthesis for RFICs with

Reinforcement Learning and Inverse Methods Demonstrating mmWave/sub-THz PAs

between 30-120 GHz

Paper 25.1 Authors: Jonathan Zhou1, Emir Ali Karahan1, Sherif Ghozzy1, Zheng Liu1,2, Hossein Jalili1, Kaushik Sengupta1

1Princeton University, Princeton, NJ, 2Kilby Labs, Texas Instruments, Dallas, TX

Subcommittee Chair: Ali Hajimiri, Caltech, Pasadena, CA

CONTEXT AND STATE OF THE ART

● AI-enabled algorithmic flow for architecture discovery, circuit topology and parameter optimization for RFICs, particularly

exploring design spaces beyond human intuition.

● Compared to prior works on optimizing circuit parameters with circuit simulation, this is the first work that demonstrates an

end-to-end RFIC AI-enabled synthesis with both active and passive optimization, and from specifications to layout with

fabricated and measured results.

TECHNICAL HIGHLIGHTS

● Princeton University and Texas Instruments leverage an AI-enabled end-to-end analog RF circuit automatic discovery

and design scheme to demonstrate a mmWave/Sub-THz circuit quickly synthesis RF circuit topology and design

parameters.

o The design time can be reduced between 3 to 6 months for a sub-THz PA designing to few minutes.

APPLICATIONS AND ECONOMIC IMPACT

● Huge design time savings for cost reduction of custom circuits design especially for RFICs.

Session 26 Overview:

Wireless Transmitters and Front-Ends

Wireless Subcommittee

Session Chair: Alireza Zolfaghari, Broadcom, Irvine, US

Session Co-Chair: Giuseppe Gramegna, imec, Leuven, Belgium

This session highlights the latest advancements in the area of wireless transmitters and front-ends. The first three papers introduce RF

transmitters (TX) in different applications spanning from a 24GHz implementation enabling high data rates to a sub-6GHz digital TX,

followed by a low-power IC driven by a neural network engine. The last two papers feature RF front-ends focusing on improved efficiency

and noise figures.

• In Paper 26.1, the University of Southern California presents a 65nm CMOS 24GHz digital TX that uses a multiphase subharmonic (SH) switching

power amplifier and a phase-shifted LO divider for improved SH spur suppression, enabling 3.2Gb/s data rate with 256QAM at -30.8dB EVM and

20.3% PAE.

• In Paper 26.2, Fudan University describes a 0.88mm2 28nm CMOS digital TX with joint digital-analog interpolation to deliver >42dBc replica

rejections for a WiFi 80MHz 256-QAM signal and >33dB dynamic power range. The TX achieves 27.8dBm/27.3dBm/24.8dBm Pout with

30.4%/30.5%/14.9% peak efficiency at 2.6GHz/3.6GHz/5.1GHz, respectively.

• In Paper 26.3, the University of Michigan introduces a 28nm CMOS crystal-less TX IC with the digital modulation and coding implemented with a

neural network, and an analog front-end achieving 41.1% power-efficiency resilience to carrier drift, and 10/4.4dB improvement in receive power

at 0.1% BER compared to uncoded/coded BLE.

• In Paper 26.4, Rice University presents a 0.22mm2 22nm CMOS 24-to-29GHz front-end featuring a two-way asymmetric Doherty PA co-designed

with LNA input matching and T/R switch, that achieves 19.6-to-20.6dBm Pout at 25.0%-to-27.7% PAE and 14.2%-to-19.8% PAE at 9.5dB back-

off, with 3.1 to 4.0dB NF.

• In Paper 26.5, Zhejiang University shows a 65nm CMOS 17.7-to-29.5GHz front-end with a PA load-capacitance-cancellation network T/R switch

in RX mode to achieve 20.2dBm OP1dB with 20.6% PAE and a NF of 3.3 to 5.0dB.

Session 26 Highlights:

Wireless Transmitters and Front-Ends

[26.2] A Wideband Replicas-Rejection Digital Transmitter Using Joint-Digital-Analog

Interpolation and Filtering in 28nm CMOS

Paper 26.2 Authors: Chunxiao Hu*, Jiaxiang Li*, Jie Lin, Hongtao Xu, Yun Yin

Paper 26.2 Affiliation: Fudan University, Shanghai, China

Subcommittee Chair: Chih-Ming Hung, Mediatek, Taiwan

CONTEXT AND STATE OF THE ART

● Improvement on system efficiency, data rate and cost reduction are required by all multiple wireless communications systems

and are driving the development of digital TX (DTX) architectures.

● In a DTX, undesired sampling replicas are close to the signal band, violating emission mask requirements, and requiring

oversampling with external filters.

● The trade-off of oversampling and selectivity of external filtering is made more challenging for a multiband system, which can

be solved using fully separated DTXs.

TECHNICAL HIGHLIGHTS

● Fudan University describes a 0.88mm2 28nm CMOS digital TX with joint digital-analog interpolation achieving

o > 42dBc replicas rejection and > 33dB dynamic power range for WiFi 40MHz 64-QAM and 80MHz 256-QAM signals,

and

o 27.8dBm/27.3dBm/24.8dBm Pout with 30.4%/30.5%/14.9% peak efficiency at 2.6GHz/3.6GHz/5.1GHz, respectively.

APPLICATIONS AND ECONOMIC IMPACT

● The technique allows efficient rejection of digital replicas and simplification of the system with associated cost reduction.

● The wideband technique alleviates the need to split a multiband system into multiple TXs and simplifies the implementation of

future multiband systems.

Session 27 Overview: Sensor Interfaces

Analog Subcommittee

Session Chair: Caspar van Vroonhoven, Analog Devices, Germany

Session Co-Chair: Chinwuba Ezekwe, Robert Bosch, CA

Duty-cycling and dynamic biasing improve power efficiency and scalability of sensor interfaces, while precision circuit techniques enable

new levels of absolute accuracy. The first paper describes a partially duty-cycled MEMS Gyroscope drive control loop that minimizes

standby power dissipation, while the third paper presents a frequency-controlled biasing method that enables a wide range of conversion

rates. The second paper demonstrates a CMOS Hall sensor with state-of-the-art offset performance. Two final papers present

temperature sensors that achieve state-of-the-art performance in both their power efficiency and temperature-sensing inaccuracy.

• In Paper 27.1, Xidian University presents a MEMS Gyroscope with an always-on drive loop, enabled by a burst-mode phase-locked loop. A

partially duty-cycled drive excitation loop maintains the gyroscope in semi-stable oscillation to enable a wake-up time of 2.8ms while limiting the

average standby power consumption to 1.5µW.

• In Paper 27.2, TU Delft presents a voltage-biased CMOS Hall sensor. Octagonal Hall plates are operated in spinning-current mode and read out

using a current-to-digital converter. The sensor achieves a magnetic field offset of 1.0μT (3σ) and a noise floor of 60nT/√Hz.

• In Paper 27.3, Zhejiang University and Vango Technologies present a continuous-time sensor interface. A frequency-controlled current source

enables a gm-C CTΣ∆M to achieve a near-consistent SNDR of ~84dB over a 225× bandwidth range. The sensor interface operates from a sub-

1V supply, has an input-referred noise density of 46nV/rtHz and an FoM of 178.2dB.

• In Paper 27.4, TU Delft presents an NPN-based temperature sensor that uses a noise-optimized charge-balancing scheme and a current-assisted

two-stage amplifier to improve both energy efficiency and accuracy. The sensor achieves an 80fJ∙K2 resolution FoM and an inaccuracy of ±0.1°C

(3σ) from -70°C to 125°C.

• In Paper 27.5, the University of Macau presents a wire-metal-based temperature sensor that uses a fractional-discharge frequency-locked-loop

and a time-domain amplifier to achieve a resolution FoM of 45fJ∙K2 and an inaccuracy of ±0.2°C (3σ) from -40°C to 125°C.

Session 27 Highlights: Scalable Sensor Interface

[27.3] A Sub-1V 14b 5.8nW/Hz BW/Power-Scalable CT Sensor Interface with a

Frequency-Controlled Current Source Achieving a 225× Scalable Range

Paper Authors: Xinjie Wu1, Yuyan Liu2, Xiaopeng Yu1, Nick Nianxiong Tan2, Zhong Tang2

Paper Affiliation: 1Zhejiang University, Hangzhou, China, 2Vango Technologies, Hangzhou, China

Subcommittee Chair: Viola Schaffer, Texas Instruments Deutschland GmbH, Freising, Germany, Analog

CONTEXT AND STATE OF THE ART

• Various IoT and wireless sensor-node applications require sensor interfaces with a variable conversion rate.

• Sensor interfaces with switched-capacitor front-ends such as SARs and discrete-time delta-sigma modulators are bandwidth

scalable but suffer from noise folding. Continuous-time front-ends do not suffer from noise folding but are not scaled as easily

when their internal circuitry incorporates fixed time constants.

TECHNICAL HIGHLIGHTS

• A continuous-time Delta-Sigma (ΔΣ) ADC is biased by a frequency-dependent current source, allowing the converter

to operate over a wide range of frequencies.

o A capacitively-biased-diode technique is used to generate a constant-Gm bias current that is proportional to

frequency but stable over temperature and supply voltage.

o A 2nd-order continuous-time ΔΣ modulator uses Gm-C integrators and an asynchronous multi-bit quantizer to achieve

bandwidth/power-scalable operation at a near-consistent SNDR of 84dB and power scaling of 5.8nW/Hz over a 225×

bandwidth range.

APPLICATIONS AND ECONOMIC IMPACT

• A bandwidth/power-scalable sensor interface enables adaptive signal acquisition – the ADC conversion rate may be adjusted

based on e.g. the richness of the sensor input data and/or the amount of energy available in the system. Adaptive signal

acquisition may enable more feature-rich IoT and wireless sensor nodes.

• More versatile sensor interfaces may reduce development time and cost due to their straightforward reconfigurability.

Session 28 Overview: Capacitive Sensor Readout

Analog Subcommittee

Session Chair: Caspar van Vroonhoven, Analog Devices, Germany

Session Co-Chair: Chinwuba Ezekwe, Robert Bosch, CA

Time-domain signal processing, two-step conversion, and continuous-time signal chains enable improvements in the performance of

capacitance-to-digital converters for a diverse range of applications. The techniques revealed in this session include the combination of

time-domain signal processing and two-step conversion to improve input range and conversion time (Paper 28.1), the use of continuous-

time high-pass noise-shaping to mitigate thermal noise aliasing and flicker noise (Paper 28.2), the use of a novel phase-domain add-

then-subtract lead-compensation scheme to stabilize a controlled-oscillator-based backend 2nd-order delta-sigma modulator (Paper 28.3),

and the use of phase-domain signal processing to enhance immunity to EMI (Paper 28.4).

• In Paper 28.1, KAIST, ETH, and NYU present a period-modulation-SAR hybrid capacitance-to-digital converter. The converter combines a period-

modulation front-end with a SAR-ADC backend to achieve 14.5b ENOB, 6.1μs conversion time, and an 18.1pF-to-18.5nF scalable full-scale

range.

• In Paper 28.2, KAIST and NYU present a continuous-time noise-shaping SAR capacitance-to-digital converter. The converter combines a

continuous-time capacitance-to-voltage converter with a continuous-time highpass noise-shaping SAR ADC to mitigate thermal noise aliasing

and flicker noise, achieving a 189.3dB-FoMS with 14.5fJ/conversion-step FoMW at 5.12pF full-scale range.

• In Paper 28.3, Peking University presents an incremental zoom capacitance-to-digital converter. The converter combines a SAR front-end with a

2nd-order delta-sigma modulator backend, which employs a Gm-C-based 1st stage and a controlled-oscillator-based 2nd stage, stabilized by an

add-then-subtract lead-compensation scheme, to achieve an 18.2fJ/conversion step FoMW at 1.2pF full-scale range.

• In Paper 28.4, Tsinghua University and CoSensing reveal a capacitance-to-digital converter for EMI-sensitive floating-target displacement-sensing

applications. Using a capacitance-to-phase-converter front-end and a phase-domain delta-sigma modulator, which together provide inherent

band-pass filtering and common-mode rejection, the converter achieves 143dB DR, with CMRRs of 119dB and 97dB near DC and 100MHz,

respectively.

Session 28 Highlights: Capacitive Displacement Sensor

[28.4] A 143dB-Dynamic-Range 119dB-CMRR Capacitance-to-Digital Converter for

High-Resolution Floating-Target Displacement Sensing

Paper Authors: Sining Pan1, Xiaolong Zhang1, Baoyi Zheng1, Yihang Cheng1, Hui Jiang2, Huaqiang Wu1

Paper Affiliation: 1Tsinghua University, Beijing, China, 2CoSensing, Utrecht, The Netherlands

Subcommittee Chair: Viola Schaffer, Texas Instruments Deutschland GmbH, Freising, Germany, Analog

CONTEXT AND STATE OF THE ART

• Capacitive floating-target displacement sensors, which sense the capacitance between a probe and a target, could be a lower-

power alternative to eddy-current-based floating-target displacement sensors.

• Existing solutions are based on singe-ended readout, which is highly vulnerable to EMI, or differential-switched-capacitor

readout, which is still vulnerable to EMI, especially at high frequencies, in any practical realization with inevitably imperfectly

matched differential-sensing capacitors.

TECHNICAL HIGHLIGHTS

• A capacitance-to-digital converter for floating-target displacement-sensing applications combines differential

readout with phase-domain signal processing to improve robustness against EMI.

o A capacitance-to-phase-converter front-end and a phase-domain delta-sigma modulator together provide inherent

bandpass filtering and common-mode rejection, enabling the converter to reject EMI as demonstrated by CMRRs of

119dB and 97dB near DC and 100MHz, respectively.

o The converter achieves a dynamic range of 143dB while dissipating 625µW.

APPLICATIONS AND ECONOMIC IMPACT

• Displacement sensors are widely used in high-precision nano-mechatronic systems.

• An EMI-robust capacitive solution offers the possibility to integrate floating-target displacement-sensing functionality into EMI-

rich environments for lower cost and low power dissipation.

Session 29 Overview: SRAM

Memory Subcommittee

Session Chair: John Wuu, AMD, Fort Collins, CO

Session Co-Chair: Yih Wang, TSMC, Hsinchu, Taiwan

SRAM continues to play an indispensable role in the relentless pursuit for higher compute performance. As traditional transistor scaling

slows, innovations and design-technology co-optimization (DTCO) are required more than ever to further extend SRAM’s density, speed

and functionality for energy-efficient compute. This session highlights four SRAM papers and one TCAM paper that push density, speed,

power and operation boundaries by combining architectural and circuit innovations with SRAM bitcells in advanced-process technologies

that include FinFET, Nanosheet and RibbonFET CMOS with backside interconnects.

• In Paper 29.1, TSMC introduces a 38.1Mb/mm2 SRAM in 2nm-CMOS-nanosheet technology for high-density and energy-efficient compute

applications. The design uses a 0.021um2 high-density bitcell, and through DTCO improves the overall SRAM density by 1.1× compared to the

previous technology node.

• In Paper 29.2, Intel reveals their SRAM design in the 18A-RibbonFET-CMOS technology, with PowerVia for back-side power delivery, using a

0.023um2 high-current and a 0.021um2 high-density bitcell. The gate-all-around technology demonstrates a 68mV VMIN reduction for the

34.3Mb/mm2 high-density SRAM design over the prior FinFET technology and supports up to 38.1Mb/mm2 area density.

• In Paper 29.3, MediaTek presents a 3nm Fin-FET TCAM with dynamically-gated search lines for data center ASICs. The 2.2G-searches/s

0.305fJ/b design achieves a 37.7% power and a 46.6% peak-current reduction through its power-optimization features at 4.97Mb/mm2.

• In Paper 29.4, Synopsys demonstrates a dual-rail SRAM design in 3nm FinFET technology. By optimizing level shifters, the design achieves a

38Mb/mm2 area density, while supporting 0.38 - 1.4V logic and 0.54 - 1.4V array voltage ranges.

• In Paper 29.5, TSMC presents a 3nm 3.6GHz dual-port SRAM with backend-RC optimizations and a far-end write-assist scheme. The 0.052um2

dual-port-bitcell based design exhibits 3.6GHz operation at 1.0V and 495mV VMIN.

Session 29 Highlights: SRAM

[29.1] A 38.1Mb/mm2 SRAM in a 2nm-CMOS-Nanosheet Technology for High-Density

and Energy-Efficient Compute

[29.2] 0.021μm2 High-Density SRAM in Intel 18A RibbonFET Technology with PowerVia-

Backside Power Delivery

Paper 29.1 Authors: Tsung-Yung Jonathan Chang, Yen-Huei Chen, K. Venkateswara Reddy, Nikhil Puri, Teja Masina, Kuo-Cheng Lin,

Po-Sheng Wang, Yangsyu Lin, Chih-Yu Lin, Yi-Hsin Nien, Hidehiro Fujiwara, Ching Wei Wu, Robin Lee, Hung-Jen Liao, Quincy Li, Ping

Wei Wang, Geoffrey Yeap

Paper 29.1 Affiliation: TSMC, Hsinchu, Taiwan

Paper 29.2 Authors: Xiaofei Wang1, Yusung Kim1, Hasan Abdullah1, Gwang Hyeon Baek1, Kunal Girish Bannore1, Kaushal Dave1,

Arash Joushaghani1, Narae Kang1, Minwoo Ko1, Kae Yih Lim2, Anandkumar Mahadevan Pillai3, Hema Chandra Prakash Movva1,

Gyusung Park1, Muktadir Rahman1, Seenivasan Subramaniam1, Vinay Vashishtha1, Teng Yang1, Zheng Guo1, Eric A Karl1

Paper 29.2 Affiliation: 1Intel, Hillsboro, OR, 2Intel, Penang, Malaysia, 3Intel, Santa Clara, CA

Subcommittee Chair: Meng-Fan (Marvin) Chang, National Tsing Hua University (NTHU), Taiwan

CONTEXT AND STATE OF THE ART

● Compute density and power efficiency are critical knobs for meeting the ever-increasing demand for compute, especially in the

age of explosive machine-learning growth.

● 3nm FinFET is the current state-of-the-art technology; however, nanosheet and RibbonFET CMOS technologies, especially

when combined with back-side power delivery, hold the promise to further propel SRAM energy efficiency.

● In the face of decelerating Moore’s Law, design innovations must also accompany technology advancements to continue

SRAM area and efficiency scaling.

TECHNICAL HIGHLIGHTS

● TSMC presents a high-density SRAM in 2nm-nanosheet CMOS for high-density and energy-efficient compute

o The high-density SRAM macro, featuring a 0.021μm2 high-density (HD) SRAM bitcell in a 2nm nanosheet CMOS,

achieves a bit density of 38.1Mb/mm2: a ~10% improvement over the prior 3nm FinFET CMOS technology.

● Intel reveals their SRAM design in the 18A RibbonFET CMOS technology with PowerVia for back-side power delivery,

using 0.023um2 high-current and 0.021um2 high-density SRAM bitcells

o Gate-all-around technology demonstrates a 68mV VMIN reduction for the 34.3Mb/mm2 high-density SRAM design,

compared to the prior FinFET technology, and support for up to a 38.1Mb/mm2 area density.

APPLICATIONS AND ECONOMIC IMPACT

● First high-volume demonstration of SRAMs from leading technology providers on a 2nm-nanosheet and 18A-RibbonFET

technology with backside power delivery.

● Technology optimization and design-technology co-optimization (DTCO) enable SRAM-density and energy-efficiency

enhancements over prior technologies, benefiting products across the industry to advance compute and broaden applications

to enrich human experiences.

Session 30 Overview: Nonvolatile Memory and DRAM

Memory Subcommittee

Session Chair: Seung-Jae Lee, Samsung, Korea

Session Co-Chair: Thomas Hein, Micron, Germany

Cutting-edge technologies continue to emerge, leading to increased memory capacities and faster performance. Samsung demonstrates

a NAND Flash memory with 4XX WL layers and 5.6Gb/s IO rate. Kioxia introduces a NAND Flash memory with the highest bit density.

Samsung introduces the 1st 24Gb GDDR7 DRAM, with a 42.5Gb/s IO rate and power-efficiency techniques. Samsung presents a 16G

LPDDR5X operating at 12.7Gb/s, with a self-calibrating IO scheme. SK Hynix introduces the highest-capacity (2Tb) QLC NAND Flash

memory, with a six-plane architecture. Kioxia and SK Hynix present the world’s highest density 64Gb cross-point MRAM chip.

• In Paper 30.1, Samsung presents a 4XX-layer 1Tb 3b/cell 3D NAND Flash with a wafer-bonding architecture, achieving a 28.2Gb/mm² bit density.

Eliminating dummy holes and using a 2-transistor coded-GSL scheme enables a 4% array size and 8% power consumption reduction. This design

also offers a fast 5.6Gb/s/pin IO speed and a low-power option with PI-LTT, cutting channel power by 73% compared to LTT.

• In Paper 30.2, Kioxia and Western Digital introduce a 1Tb 3b/cell 3D Flash memory, achieving a 29% read energy-efficiency improvement and

features a 4.8Gb/s I/O interface using PI-LTT. The I/O circuit area is reduced by 17% by circuit optimization and CMOS directly-bonded-to-array

(CBA) technology. This design also achieves a bit density over 29Gb/mm² with 332-WL layers.

• In Paper 30.3, Samsung shows the 1st implementation of a 24Gb GDDR7 DRAM and 42.5Gb/s data rate, with power reduction techniques for

WCK clocking and core operation; thereby, achieving a 13.6% energy-efficiency improvement over prior GDDR7.

• In Paper 30.4, Samsung presents a 16Gb 12.7Gb/s LPDDR5X DRAM with a self-calibrating IO scheme and AC-coupled equalization in a 5th-

generation 10nm technology. The distributed 4-phase clock skew is reduced by 33.7% via self-calibration.

• In Paper 30.5, SK Hynix presents a 2Tb 4b/cell 3D-NAND Flash memory with a 3.2Gb/s IO speed and a 6-plane architecture. Increased internal-

bias levels, during programming, reduce program disturbance by 18%. A precise bias generator ensures consistent bias levels across

temperatures, while efficient power gating in the TX interface decreases CIO by 20% and area by 55%.

• In Paper 30.6, Kioxia and SK Hynix introduce the world’s highest density 64Gb cross-point MRAM chip, featuring a 0.001681µm² 20.5nm HP 1-

selector 1-MTJ cell. The work presents a timing-controlled-discharge reading scheme, combined with a local-capacitance mode which enables a

high-speed 3ns read pulse and a read margin of over 4σ, while addressing read disturbances using the smallest cell size to date.

Session 30 Highlights: Nonvolatile Memory and DRAM

[30.1] A 28Gb/mm2 4XX-Layer 1Tb 3b/cell WF-Bonding 3D-NAND Flash with 5.6Gb/s/pin

IOs

[30.3] A 24Gb 42.5Gb/s GDDR7 DRAM with Low-Power-WCK Distribution, an RC-

Optimized Dual-Emphasis TX, and Voltage/Time-Margin-Enhanced Power Reduction

Paper 30.1 Authors: Sang-Soo Park, Jae-Doeg Lyu, Myungjun Kim, Jaeyun Lee, Younsun Song, Chung-Ho Yu, Hirano Makoto,

Yongseok Kwon, Jong-Hoon Park, Ho-Joon Kim, Daein Lee, Donghyun Seo, Byungrok Go, Seoyoon Jeon, Yoonjee Kim, Doo-Hyun

Kim, Youngmin Jo, Hyunjun Yoon, Junehong Park, Inmo Kim, Sunghoon Kim, Hokil Lee, Je-Hyeon Yu, Sang-Lok Kim, Hwan-Seok Ku,

Jungmin Seo, Jindo Byun, Seung-Hyeon Yun, Kyoungtae Kang, Seung-Beom Kim, Yohan Lee, Yongkyu Lee, Kyunghwa Kang, Han-

Jun Lee, Younghwan Ryu, Hyundo Kim, Wontae Kim, Hyeongdo Choi, Juho Jeon, Ansoo Park, Raehyun Song, Jae-Hwan Kim, Jung-

Soo Kim, Hwa-Seok Lee, Moo-Kyung Lee, Jae-Ick Son, Jiho Cho, Moosung Kim, Jae-Woo Im, Jongmin Park, Hyuckjoon Kwon,

Youngdon Choi, Chiweon Yoon, Seungjae Lee, Kiwhan Song, Sung-Hoi Hur

Paper 30.1 Affiliation: Samsung Electronics, Hwaseong-si, Korea

Paper 30.3 Authors: MSang-Hoon Kim, Jaehyeok Baek, Moon-Chul Choi, Daewoong Lee, Donggun An, Se mi Kim, Yeonggeun

Song, Minkyo Shim, Sung-Yong Cho, Dongha Lee, Gunhee Cho, In-Woo Jun, Juseop Park, TaeYoon Lee, Hwan-Chul Jung,

Chanyong Lee, Gil-Young Kang, Hye-Ran Kim, Jongmyung Lee, Young Su Joo, Hyo-Jin Jung, Bokyeon Won, Ji-Hak Yu, Sangkeun

Han, Yechan Hwang, Chungman Kim, Seok-Jung Kim, YoungSeok Lee, Young-Tae Kim, Myeong-O Kim, Wonhwa Shin, Tae-Young

Oh, SangJoon Hwang

Paper 30.3 Affiliation: Samsung Electronics, Hwaseong, Korea

Subcommittee Chair: Meng-Fang Chang, Tsing Hua University, Taiwan, Memory

CONTEXT AND STATE OF THE ART

● Paper 30.1 describes NAND Flash with 4XX WL layers and 5.6Gb/s/pin data rate, it emphasizes power reduction and

improved IO speed, thereby addressing current challenges in NAND Flash memory.

● High-speed graphics GDDR7 with PAM3 interfaces is on the forefront of interface speed and DRAM architecture. The focus is

now shifted towards power efficiency, while still increasing the speed.

TECHNICAL HIGHLIGHTS

● Samsung introduces a 1Tb 3b/cell 3D-NAND Flash with 4XX WL layers and 5.6Gb/s/pin IO speed.

o By removing dummy holes and adopting a 2-transistor coded-GSL scheme, they reduce array size by 4% and power

consumption by 8%.

o This work achieves the fastest (5.6Gb/s/pin) IO speed, along with a low-power using PI-LTT that reduces channel

power by 73% than LTT.

● Samsung Electronics presents the 1st implementation of a 24Gb GDDR7 DRAM achieving a data rate of 42.5Gb/s with

power-reduction techniques for core operation.

o A two-row DQ architecture, with low-power WCK-distribution techniques, reduces clock-distribution power. The

sense-margin enhancement and power-reduction techniques achieve a 13.6% energy efficiency improvement.

APPLICATIONS AND ECONOMIC IMPACT

● It is anticipated that enhanced performance will be achieved in mobile and PCIe applications, while power consumption is

reduced.

● High-performance graphics DRAMs are now used everywhere where high bandwidth is required: gaming, AI training and

inference, and advanced driver-assist systems (ADAS) for automotive applications.

● Architectural and circuit changes are required to achieve the highest bandwidth, while still achieving the lowest possible power

consumption. Novel high-speed interface techniques are applied to achieve reliable PAM3 data transmission.

Session 31 Overview: Energy Harvesting and IoT Power

Power Management Subcommittee

Session Chair: Gaël Pillonnet, CEA-Leti, Grenoble, France

Session Co-Chair: Sung-Wan Hong, Sogang University, Seoul, Korea

Recent advancements in energy harvesting and IoT power management are featured in four papers at this year's ISSCC. Paper 31.1

presents a hybrid system that combines piezoelectric and electromagnetic harvesters to achieve improved efficiency and circuit

compactness. Paper 31.2 introduces a biased-SECE interface for fast maximum power point tracking in piezoelectric harvesting. Paper

31.3 highlights a sense-and-track MPPT circuit designed for continuous power extraction under various conditions, including periodic and

shock excitations. The final paper, 31.4, showcases a PMIC optimized for mm-scale IoT applications, enhancing burst energy delivery

through a hybrid combination of capacitors and batteries.

• In Paper 31.1, TU Delft presents a fully integrated hybrid energy harvesting system using a coil-sharing structure, combining piezoelectric and

electromagnetic harvesters. It achieves 2.72mW power with 90% efficiency, eliminating the need for a bulky off-chip electrical interface.

• In Paper 31.2, KAIST presents a biased-SECE interface with geometric-mean computational MPPT for a piezoelectric harvester. It tracks MPP

quickly and accurately without interrupting the harvesting, achieving 99.9% efficiency. Fabricated in 180nm BCD, the system improves FoMs by

9.29× for periodic excitation and 5.02× for shock inputs.

• In Paper 31.3, TU Delft University presents a sense-and-track MPPT circuit for piezoelectric energy harvesting, achieving fast, continuous MPPT

and improving power extraction by 718% under periodic and 568% under shock excitations. Fabricated in 0.18μm BCD, the circuit operates

efficiently with a 0.67mm² active area.

• In Paper 31.4, University of Macau presents a capacitive PMIC with 91.25% peak efficiency for mm-scale IoT applications, featuring a

reconfigurable HV converter. It achieves a ~70% energy extraction efficiency (EEE) and delivers up to 5.68mJ using a 220μF capacitor, improving

EEE by over 35× and burst energy by 27×, while reducing storage capacitance by 34× compared to prior designs.

Session 31 Highlights: Energy Harvesting and IoT Power

[31.1] An Inductor-less Capacitor-less Synchronous Piezoelectric-Electromagnetic

Hybrid Energy Harvesting Platform with Coil-Sharing Scheme

Paper Authors: Y. Wei, X. Yue, Z. Chen, S. Du

Paper Affiliation: Delft University of Technology, Delft, The Netherlands

Subcommittee Chair: Bernhard Wicht

CONTEXT AND STATE OF THE ART

• The growing adoption of IoT and edge AI applications is increasing the demand for sustainable and compact energy harvesting

solutions to power devices, aiming to replace batteries or extend their lifespan.

• Current piezoelectric-based energy harvesting interfaces often rely on bulky off-chip inductors or multiple capacitors, which adds

to the system's volume and bill of materials (BoM), emphasizing the need for innovative solutions to enhance integration.

• Combining different types of harvesters can offer more reliable and efficient energy production; however, this approach faces

challenges with interface complexity, which can further increase the size and impact the overall performance.

TECHNICAL HIGHLIGHTS

• A novel hybrid energy harvesting platform integrates piezoelectric and electromagnetic sources, utilizing a coil-sharing

structure and synchronized energy phases to eliminate the need for passive components in the electrical interface and

maximize harvested power.

o The energy harvesting system, which extracts energy from both electromagnetic and piezoelectric sources, achieves

a maximum output power of 2.72mW and a peak efficiency of 90% without the use of additional passive components,

significantly boosting energy extraction capabilities for IoT devices.

APPLICATIONS AND ECONOMIC IMPACT

• This energy harvesting solution is ideal for powering small-scale IoT devices in vibration-rich environments, enabling self-

sustained operation or extending battery life with a minimal electrical interface. Utilizing dual sources enhances the robustness

and power density of energy harvesting by employing two mechanisms.

• By eliminating the need for passive components used in traditional electrical interfaces, this platform lowers production costs

and simplifies system-level manufacturing, making it a compelling choice in the expanding energy harvesting market.

Session 32 Overview: Isolated Power and Gate Drivers

Power Management Subcommittee

Session Chair: Lin Cheng, University of Science and Technology of China, China

Session Co-Chair: Shusuke Kawai, Toshiba Corporation, Japan

Isolated Power and Gate Drivers are critical for ensuring safety and reliability in harsh industrial environments, such as electric vehicles

and industrial automation. This session aims to showcase some of the best work in the isolated power and gate drivers area, featuring

novel topologies that offer high conversion efficiency, reduced electromagnetic interference (EMI), and advanced power regulation for

isolated converters, as well as precise gate control techniques for GaN gate drivers.

• In Paper 32.1, The University of Macau and Instituto Superior Tecnico/University of Lisboa present an isolated DC-DC converter with a Q-downsize

Class-D power amplifier with inherent shoot-through current blocking. The design achieves 45.3% peak efficiency and 1W maximum output power

at 180MHz, while meeting CISPR-32 Class-B certification.

• In Paper 32.2, Iowa State University introduces a single-link multi-domain-output (SLiMDO) isolated DC-DC converter, providing two regulated

outputs and achieving global power modulation via a single FPC micro-transformer with passive magnetic flux sharing and Rx behavior sensing.

The design achieves 62.6% peak efficiency at 610mW and 1.13W maximum output power.

• In Paper 32.3, National Yang Ming Chiao Tung University, Chip-GaN Power Semiconductor, Realtek Semiconductor and Taiwan Semiconductor

Research Institute present a GaN-based CGS isolated driver featuring an accurate secondary-side controller for precise dead-time control. It

achieves less than 3ns delay to turn on/off GaN-based rectifiers.

• In Paper 32.4, University of Science and Technology of China presents a dual-LC-resonant isolated DC-DC converter incorporating inherent

backscattering, which enhances efficiency and eliminates the need for a digital isolator. The design achieves 65.4% peak efficiency and 1.5W

maximum output power.

• In Paper 32.5, University of Science and Technology of China and Hefei CLT Microelectronics present a multi-core isolated DC-DC converter

with an embedded magnetic-core transformer to reduce EMI and output voltage ripple. The design achieves 53.2% peak efficiency and less than

5mV ripple while meeting CISPR-32 Class-B EMI compliance.

• In Paper 32.6, National Yang Ming Chiao Tung University, Chip-GaN Power Semiconductor, Realtek Semiconductor, and Taiwan Semiconductor

Research Institute present a GaN load switch for bidirectional current transfer and reverse current blocking. It achieves 91.4% inrush current

reduction and 89.5% overshoot reduction.

Session 32 Highlights: Isolated Power and Gate Drivers

[32.1] A 180MHz 45.3% Peak Efficiency Isolated Converter Using Q-Downsize Class-D

Power Amplifier with Inherent Shoot-Through

Paper 32.1 Authors: Tian Xia*1, Qiujin Chen*1, Shujing Wang1, Rui Paulo Martins1,2, Mo Huang1

Paper 32.1 Affiliation: 1University of Macau, Taipa, Macau, 2Instituto Superior Tecnico/University of Lisboa, Lisbon, Portugal of

Science and Technology of China, Hefei, China

Subcommittee Chair: Bernhard Wicht, University of Hannover, Germany, Power Management

CONTEXT AND STATE OF THE ART

● Isolated DC-DC converters with low electromagnetic interference (EMI) are essential for high reliability systems in harsh

industrial environments. Class-D PA and Class-D oscillation circuits are used to reduce noise and improve efficiency.

TECHNICAL HIGHLIGHTS

● The University of Macau and Instituto Superior Tecnico/University of Lisboa propose a Q-downsize Class-D PA type

isolated converter that combines the advantages of Class-D PA and Class-D oscillator circuits. The proposed circuit

simultaneously reduces the through current and the MOS drive loss at the TX, improving efficiency.

o The converter using the Q-downsize technique achieves a 45.3% peak efficiency and maximum 1W output power, at

a 180MHz operating frequency. The converter with symmetrical topology meets the CISPR-32 Class-B certification

requirements.

APPLICATIONS AND ECONOMIC IMPACT

● The proposed converter achieving highly efficient, low noise, and high frequency operation contributes to reducing system

volume, lowering power consumption, and improving reliability for systems in harsh industrial environments.

Session 33 Overview: Components for Beyond 100GHz

RF Subcommittee

Session Chair: Jeremy Dunworth, Qualcomm Technologies Inc., CA

Session Co-Chair: Hiroshi Hamada, NTT, Atsugi, Japan

Session Co-Chair: Mona M. Hella*, Rensselaer Polytechnic Institute, NY

Systems operating in the frequency range above 100GHz continue to face challenges associated with output power, tuning range,

power consumption, and integration levels. This session explores circuit techniques, advanced transistor technology options, and

package integration and co-design that address such challenges. The session includes an amplifier-multiplier chain with high output

power above 200GHz, a highly integrated GaN PA MMIC and module, two D-band phase shifters, and a G-band VCO. These

papers enable applications in high resolution radars, sensing, spectroscopy, ultra-high speed communications, and others.

• In Paper 33.1, the Massachusetts Institute of Technology describes a 232-to-260GHz amplifier-multiplier-chain (AMC) array in the Intel16

CMOS process radiating 11.1dBm. This is achieved due to a high-power RF FinFET transistor technology and a patterned dielectric matching

sheet to replace the conventional silicon lens.

• In Paper 33.2, the National Key Laboratory of Solid-State Microwave Devices and Circuits, Nanjing Electronic Device Institute, and Tianjin

University demonstrate a solid-state power amplifier with a maximum output power of 1.54W at 223GHz achieved by combining 32 GaN

MMICs co-designed with the package. The GaN MMIC PAs are enhanced by carefully designed AlN/GaN heterojunction

(fT/fMAX=180GHz/420GHz), and the design procedure utilizes a multiband large-signal impedance-correction technique.

• In Paper 33.3, the University of Pavia showcases a hybrid active/passive phase shifter in 55nm SiGe BiCMOS. The phase shifter achieves

360-degree range across 125-to-170GHz band using an outphasing gain and phase calibration. The output 1dB compression point is above

2dBm with 31mW DC power consumption, corresponding to a 5× power efficiency enhancement against previous works.

• In Paper 33.4, ETH-Zurich presents a bidirectional phase shifter featuring 11.25° resolution and achieving a calibration-free fractional

bandwidth of 24% with <2.38°/0.63dB rms phase and gain errors across the 110-to-140GHz band in 22nm SOI.

• In Paper 33.5, Rensselaer Polytechnic Institute demonstrates a nearly 20% tuning range G-Band VCO (202.1 to 246.8GHz) with a varactor-

less multisection switch-loaded coupled-line resonator responsible for the extended tuning range and improved quality factor (>10) across

the band in 22nm FDSOI.

*1971 to 2025

Session 33 Highlights: Components for Beyond 100GHz

[33.2] A 216-to-226GHz Watt-Level GaN Solid-State Power Amplifier with Multiband

Large-Signal Impedance Correction and Circuit-Package Co-Design Technique

Paper Authors: Weibo Wang1,2, Zhe Li3, Haifeng Cheng1,2, Fangjin Guo1,2, Yibin Zhang1,2, Kai Li3, Keping Wang3

Paper Affiliation: 1National Key Laboratory of Solid-State Microwave Devices and Circuits, Nanjing, China, 2Nanjing Electronic Device

Institute, Nanjing, China, 3Tianjin University, Tianjin, China

Subcommittee Chair: Brian Ginsburg, Texas Instruments, Dallas, TX,, RF subcommittee

CONTEXT AND STATE OF THE ART

● 220GHz solid-state power amplifiers are typically built in InP, SiGe, or CMOS technologies. Amplifiers in CMOS/SiGe

technologies have limited output power due to low operating voltage and low (fT/fmax) while InP suffers from high cost and low

yield.

● Although GaN HEMTs possess wide bandgap, high breakdown voltage, and electron saturation drift velocity, no G-band

(220GHz) PAs in GaN HEMT technology have been reported to date.

TECHNICAL HIGHLIGHTS

● A 216-to-226GHz GaN solid-state power amplifier achieves 1.53W peak output power at 223GHz by using waveguides

to combine the outputs of 4 GaN MMIC modules. Each module consists of 8 GaN MMIC PAs with max Psat of 20.4dBm

per PA.

o GaN transistors fT and fmax are improved through several modifications to the GaN epitaxy, which increase

transconductance, reduce contact resistance, gate-source and gate-drain capacitances, and enhance the 2DEG

concentration via an AlN/GaN strongly polarized heterojunction.

o A large-signal impedance-correction technique is employed to measure output-power (Pout) peaks across W and D-

bands with broadband output matching networks. The large-signal model is corrected to improve Pout across all

frequencies.

o The GaN PA MMIC, waveguide cavity, bondwire, and probe-microstrip-transition structures are co-designed to avoid

impedance variations at assembly interfaces.

APPLICATIONS AND ECONOMIC IMPACT

● Increased G-band power-amplifier Psat enables future high-data-rate wireless communication, imaging, and radar systems.

● The availability of over 1W Psat at frequencies above 200GHz creates opportunities for instrumentation and sub-THz systems

that would not otherwise be possible.

Session 34 Overview:

Digital PLLs and Waveform-Shaping VCOs

RF Subcommittee

Session Chair: Xiang Gao, Zhejiang University, Hangzhou, China

Session Co-Chair: Yu-Li Hsueh, MediaTek, Taiwan

Digital PLLs and waveform-shaping VCOs continue to evolve and diversify. The first paper in this session describes a low-jitter, quantization-error-

compensating, BBPD-based fractional-N DPLL. The second paper introduces a DPLL-based frequency modulator for BLE. In the third paper, a DPLL

design is presented featuring a harmonic-shaping DCO with adaptive tuning of its common-mode resonance. An octave-tuning-range VCO with

magnetic mode switching and wide-band common-mode resonator is introduced in the 4th paper. This is followed by a quad-core class-F23 VCO,

designed without 2nd/3rd harmonic tuning. The final paper showcases a mm-wave VCO design incorporating a two-port resonator-based quasi-Class-

E Colpitts oscillation.

• In Paper 34.1, Seoul National University presents a 10.1GHz low-jitter fractional-N DPLL with a quantization-error-compensating BBPD, which

reduces the in-band noise and rms jitter. It also utilizes orthogonal-polynomial LMS multi-variable calibration to accelerate its convergence speed.

The DPLL achieves 65fs rms jitter and -252.2dB FoM.

• In Paper 34.2, Politecnico di Milano introduces a 2.25-to-2.5GHz digital-PLL-based frequency modulator, supporting 1Mb/s GFSK and 80MHz

frequency hopping for BLE. It exploits a fast-hopping technique with gain calibration of the DCO tuning curve and a gear-shift mechanism for re-

locking. The PLL consumes 380µW with sub-20µs settling time for the GFSK modulator.

• In Paper 34.3, Politecnico di Milano presents an adaptive technique to automatically tune the common-mode resonance of a harmonic-shaping

DCO in a DPLL. The DCO phase noise is automatically minimized in the flicker- and white-noise regions, and the PLL achieves 45.86fs rms jitter

and -257dB FoM.

• In Paper 34.4, Rice University presents an octave-tuning-range VCO with magnetic mode switching and a tuning-free, octave-BW common-mode

resonator for 1/f3 PN corner reduction. The 65nm-CMOS LO-generator design demonstrates a wide tuning range from 9.05 to 37GHz,

190.7dBc/Hz worst-case FoM, and 212.3dBc/Hz worst-case FoMT.

• In Paper 34.5, the Nanjing University of Science and Technology introduces a quad-core Class-F23 VCO with a circular trifilar transformer to

extend the differential-mode resonance around the 3rd harmonic and an 8-shape stacked inductor to extend the common-mode resonant peak

around the 2nd harmonic. The 65nm-CMOS 18.5-to-23.6GHz VCO achieves -139.3 to -138.0dBc/Hz phase noise, 193.0 to 190.1dBc/Hz FoM at

a 10MHz offset, and 140 to 250kHz 1/f3 corner frequency.

• In Paper 34.6, the Chinese University of Hong Kong proposes to improve the VCO PN by exploiting quasi-Class-E Colpitts oscillation based on a

two-port resonator. It enables fundamental oscillation, waveform shaping for power efficiency enhancement, and PN suppression. A 40nm-CMOS

prototype exhibits an FTR of 47.3 to 58.4GHz (21.1%), PN from –124.9 to –119.8dBc/Hz at a 10MHz offset, 5.5 to 10.5mW power consumption,

and 198.8 to 192.6dBc/Hz FoMT.

Session 34 Highlights:

Digital PLLs and Waveform-Shaping VCOs

[34.2] A 380μW and -242.8dB FoM Digital-PLL-Based GFSK Modulator with sub-20μs

Settling Frequency Hopping for Bluetooth Low-Energy in 22nm CMOS

Paper Authors: Simone Mattia Dartizio, Giacomo Castoro, Stefano Gallucci, Michele Rossoni, Riccardo Moleri,

Francesco Tesolin, Pietro Salvi, Saleh Karman, Andrea Leonardo Lacaita, Salvatore Levantino

Paper Affiliation: Politecnico di Milano, Milano, Italy

Subcommittee Chair: Brian Ginsburg, Texas Instruments, Dallas, TX, RF Subcommittee

CONTEXT AND STATE OF THE ART

● A very low-power PLL-based frequency modulator supports 1Mb/s GFSK and 80MHz frequency hopping for BLE. It exploits a

fast-hopping technique with gain calibration of the DCO tuning curve and a gear-shift mechanism for re-locking.

TECHNICAL HIGHLIGHTS

● Politecnico di Milano presents a 22nm digital-PLL achieving 1Mb/s GFSK frequency modulation and fast frequency

hopping with very low power consumption

o A 2.25-to-2.5GHz frequency modulator supporting 1Mb/s GFSK and 80MHz frequency hopping for BLE. It exploits a

fast-hopping technique with dither-based gain calibration of the DCO tuning curve and a gear-shift mechanism for re-

locking. The PLL consumes 380µW with sub-20µs settling for the GFSK modulator.

APPLICATIONS AND ECONOMIC IMPACT

● High power-efficiency techniques for PLL-based frequency modulators.

● Suitable for ultra-low-power applications, such as BLE.

Session 35 Overview:

Implantable and Wearable Biomedical Devices

Imagers, Medical, and Displays Subcommittee

Session Chair: Mehdi Kiani, Pennsylvania State University, University Park, PA, USA

Session Co-Chair: Bo Zhao, Zhejiang University, Hangzhou, China

Biomedical devices for implantable and wearable applications integrate various functionalities, including wireless power and data transfer,

recording and stimulation, security, and imaging. The papers in this session continue to explore different modalities for wireless power

and data transfer (e.g., inductive, ultrasound, magnetoelectric), neural stimulation (e.g., electrical, ultrasound), and biomedical imaging

(e.g., photoacoustic, bioimpedance spectroscopy). The first half of the session focuses on ASICs designed for security, ultrasound

stimulation and imaging, and bioimpedance spectroscopy, while the second half highlights ASICs with wireless power/data capabilities.

• In Paper 35.1, Delft University of Technology presents an efficient high-voltage transmit beamformer ASIC for wearable ultrasound stimulation. It

drives a 32-element array with 36V phase-delayed pulses using a single inductor that resonates with the capacitance of the transducer elements,

reducing the energy loss by 88.2%.

• In Paper 35.2, Rice University introduces a low-power photoacoustic receiver for wearable imagers that employs a multichannel analog spatial-

domain compressive sensing. The ASIC provides 4× reduction in output data rate and number of ADCs.

• In Paper 35.3, University College London presents a 30MHz wideband bioimpedance spectroscopy IC with time-to-digital demodulation. The IC

achieves an equivalent sampling rate of 21.6GHz with a 240MHz clock utilizing a co-prime delay-locked sampling method. At 30MHz, the IC has

an overall 92.7dB SNR, 99.6% accuracy with 0.39% magnitude error and 0.57° phase error, and 3.18mW power consumption.

• In Paper 35.4, Rice University introduces a mechanical-input-based two-factor authentication (2FA) protocol in a miniature implant powered

wirelessly by a magnetoelectric transducer. It achieves a 2FA success rate of 98.27% and demonstrates continuous uplink with on-off keying

modulation at up to 110kb/s.

• In Paper 35.5, Korea Advanced Institute of Science & Technology shows a wireless adiabatic neural stimulation system employing current-mode

power reception through inductive coupling. The ASIC achieves precise charge delivery, a power conversion efficiency of 72.5%, and a stimulation

efficiency factor of 6.02.

• In Paper 35.6, Korea Advanced Institute of Science & Technology presents an enhanced-frequency-splitting-based wireless power and data

transfer system for small implants employing link-load isolation. The wireless system achieves an end-to-end power efficiency of 60.2% and a

downlink data rate of 1Mb/s using a sub-cm receiver coil.

• In Paper 35.7, Pohang University of Science and Technology presents an automated resonant current-mode ASIC for wireless power transfer

with a three-dimensional receiver coil. The power-management ASIC achieves up to 75.1% power conversion efficiency, while the three-

dimensional receiver coil provides robustness against misalignments.

• In Paper 35.8, University of California at Berkeley describes DustNet, an ultrasound-based transceiver capable of supporting up to 8 implants

through time-division multiplexing with 16-level amplitude-shift-keying backscatter modulation. The wireless system operates at 90mm depth with

a maximum data rate of 400kb/s at 2MHz carrier frequency (50kb/s uplink per implant).

Session 35 Highlights: Photoacoustic/Implant RXs

[35.2] A Spatial-Domain Compressive-Sensing Photoacoustic Imager with Matrix-

Multiplying SAR ADC

[35.6] An Enhanced-Frequency-Splitting-Based Wireless Power and Data Transfer

System Achieving 60.2% End-to-End Efficiency and 1Mb/s Data Rate with a Sub-cm RX

Coil for Miniaturized Implants

Paper 35.2 Authors: Huan-Cheng Liao, Shunyao Zhang, Yumin Su, Arvind Govinday, Yiwei Zou, Wei Wang, Vivek Boominathan,

Ashok Veeraraghavan, Lei Li, Kaiyuan Yang

Paper 35.2 Affiliation: Rice University, Houston, TX

Paper 35.6 Authors: Yechan Park, Chul Kim, Minkyu Je

Paper 35.6 Affiliation: KAIST, Daejeon, Korea

Subcommittee Chair: Rikky Muller, University of California, Berkeley, CA

CONTEXT AND STATE OF THE ART

● A compact and low-power photoacoustic RX employs a multichannel analog spatial-domain compressive sensing to achieve a

significant reduction in output data rate and number of ADC.

● A matrix-vector multiplying SAR ADC supports a general sensing matrix with a ternary setting, while two reconstructed methods

based on fast iterative shrinkage-thresholding algorithm and implicit neural representations reconstruct the compressed signal.

● An enhanced-frequency-splitting-based wireless power and data transfer system for miniaturized implants achieves a high end-

to-end efficiency and a high data rate using a sub-cm RX coil.

● The system minimizes coil size by employing a dynamically controlled LC tank and link-load isolation, enhancing the

performance in compact wireless implants.

TECHNICAL HIGHLIGHTS

● Rice University introduces a spatial-domain compressive sensing photoacoustic RX with matrix-multiplying SAR

ADC

o A photoacoustic RX chip incorporating spatial domain compressive sensing achieves a compression of the output

data rate by a factor of 4 to 8.

● KAIST presents an enhanced-frequency-splitting-based wireless power and data transfer system for miniaturized

implants.

o The enhanced-frequency-splitting-based wireless power and data transfer system achieves 60.2% end-to-end

efficiency and 1Mb/s data rate using a sub-cm RX coil.

APPLICATIONS AND ECONOMIC IMPACT

● Compressive sensing RX hardware supports a general sensing matrix while using fewer ADCs than input channels, potentially

enabling the envisioned wearable photoacoustic imager.

● The technique minimizes coil size and mitigates trade-offs in wireless power and data transfer systems, enabling enhanced

performance in compact wireless implants.

Session 36 Overview: Ultra-High-Density D2D

and High-Performance Optical Transceivers

Wireline Subcommittee

Session Chair: Masum Hossain, Carleton University, Ottawa, Canada

Session Co-Chair: Tamer Ali, MediaTek, Irvine, USA,

Subcommittee Chair: Thomas Toifl, Cisco Systems, Thalwil, Switzerland

ML-driven computing demands higher bandwidth density and improved energy efficiency. High-density die-to-die interconnects with ultra-high-speed

silicon photonics solutions can provide low latency and wide-bandwidth connectivity with excellent energy efficiency on a chiplet platform. The first three

papers in the session will present high-density electrical interconnect solutions whereas the last five papers will focus on different challenges of optical

transceivers. The first paper from TSMC reports the highest beachfront density in 3nm finFET technology. The second paper in the session presents

innovative crosstalk and echo cancellation techniques for single-ended signaling. The third paper from Cadence presents UCIe-compliant high-density

interconnect over a 2.5D interposer interface in 3nm FinFET. In the fourth paper, Intel will present the highest reported speed VCSEL driver and

transimpedance amplifier front-end in CMOS technology. In the fifth paper, Xi'an JiaoTong University presents a low-latency solution for 200Gb/s

pluggable optics. The sixth paper in the session introduces several circuit techniques to achieve a 112Gb/s PAM-4 linear TIA with 0.61pJ/b energy

efficiency. The seventh paper from the University of Illinois at Urbana-Champaign will present a 64Gb/s, 16-QAM analog coherent receiver designed

for high-speed short-reach optical links. The 8th paper from the Southern University of Science and Technology will present a 160Gb/s PAM-4 driver

with a swing of 3.8Vppd in 130nm SiGe BiCMOS. In the last paper of the session, Marvell will present a 212Gb/s PAM-4 retimer solution in 5nm FinFET

technology achieving a swing of 3Vppd with a BW of 54GHz and a TDECQ of 2.04dB when driving a Mach-Zehnder Modulator.

• In Paper 36.1 TSMC presents a UCIe-compliant die-to-die interface achieving 10.5Tb/s/mm beachfront density in 3nm technology. Over 64 lanes

this paper demonstrates an aggregate eye width of 19ps (61% UI) and an eye height of 560mV at 32Gb/s after lane deskew correction, with an

energy efficiency of 0.6pJ/b.

• In Paper 36.2, Peking University presents a 64Gb/s/wire single-ended simultaneous bi-directional transceiver using crosstalk and echo

cancellation techniques for die-to-die interfaces over 3mm shield-less on-chip channel achieving BER of 1e-16.

• In Paper 36.3, Cadence presents a 0.29pJ/b, 5.27Tb/s/mm UCIe interface over 2.5D interposer interface in 3nm FinFET. The PHY supports 4-

16Gb/s/pin achieving a BER of 1e-15 and latency of 3.5ns.

• In Paper 36.4, Intel presents a PAM-4 VCSEL-based direct-drive optical engine that integrates VCSEL driver and transimpedance amplifier front-

end CMOS ICs. Several circuit techniques are introduced to enable the >100Gb/s PAM-4 operation to achieve 0.9pJ/b energy efficiency.

• In Paper 36.5, Xi’an JiaoTong University presents a low-latency solution for 200Gb/s pluggable optics. The link includes a 200G transceiver with

a 200Gb/s analog MUX/DEMUX in 130nm SiGe and a 100Gb/s mixed-signal transceiver in 28nm CMOS achieving BER of 1e-12 transferring

PRBS7 at 200Gb/s.

• In Paper 36.6, Southern University of Science and Technology presents a 112Gb/s PAM-4 linear TIA. Employing several design techniques

including regulated active-input-termination TIA (AIT-TIA) and signal interpolation-based single-ended-to-differential converter (S2D) achieves

the longest 0.2inch PD-TIA reach with an excellent power efficiency of 0.61pJ/b.

• In Paper 36.7, the University of Illinois at Urbana-Champaign presents a low-power 16-QAM analog coherent receiver designed for high-speed

short-reach optical links. The prototype receiver recovers 64Gb/s data with a BER better than 1e-10 and achieves a power efficiency of 1.54pJ/b

in 28nm CMOS.

• In Paper 36.8, Southern University of Science and Technology presents a high swing linear driver. The driver topology utilizes cross-coupled Gm

cells and a cross-folded transmission line to achieve a maximum data rate of 160Gb/s PAM-4 with a swing of 3.8Vppd and a THD of 2.21% at 4Vppd

5GHz sinusoidal output in 130nm SiGe BiCMOS.

• In Paper 36.9, Marvell presents a 212Gb/s PAM-4 retimer solution demonstrating electrical long-reach capability and high swing optical driver in

5nm FinFET technology. The optical driver achieves a swing of 3Vppd with a BW of 54GHz and a TDECQ of 2.04dB when driving a Mach-Zehnder

Modulator. The C2M interface achieves a BER of 1.8e-10 on a 40dB loss channel with an energy efficiency of 2.76pJ/b.

Session 36 Highlights: Ultra-High-Density D2D

and High-Performance Optical Transceivers

[36.1] A 32Gb/s 10.5Tb/s/mm 0.6pJ/b UCIe-Compliant Low-Latency Interface in 3nm

Featuring Matched-Delay for Dynamic Clock Gating

Paper Authors: Mu-Shan Lin1, Chien-Chun Tsai1, Shenggao Li2, Wei-Chih Chen1, Wen-Hung Huang1, Yu-Chi Chen1, Yu-Jie Huang1,

Alan Drake3, Chin-Hua Wen1, Paul Ranucci3, Hsin-Hung Kuo1, Aidong Yin3, Shu-Chun Yang1, Farsheed Mahmoudi3, Han-Tzung Ke1,

Chao-Chieh Li1, Nai-Chen Cheng1, Jimmy Wang4, Kevin Lin1, Harry Liao4, Jie-Ren Huang1, Meng-Hsuan Wu4, Kenny Cheng-Hsiang

Hsieh1, Nicholas Amatruda5, William Polanco5, David King5, Todd Basso6, Anwar Kashem6

Paper Affiliation: 1TSMC, Hsinchu, Taiwan, 2TSMC, San Jose, CA, 3TSMC, Austin, TX, 4TSMC, Nanjing, China, 5AMD, Austin, TX,

6AMD, San Jose, CA

[36.9] A 212Gb/s PAM-4 Retimer with Integrated High-Swing Optical Driver and Chip-to-

Module Long Reach Capability of 40dB in 5nm FinFET

Paper Authors: V Gurumoorthy1, A Tan1, A Iyer1, A Fan2, A Farhoodfar1, B Alnabulsi3, C Abidin2, C Loi4, D Cartina5, H Lo1, I Fabiano6,

J Riani1, J H Teo4, J Q Wang1, K Raviprakash1, K K Ravi Prakash1, L Cai4, L Patra1, M Bachu1, N Codega6, N Shivashankar1, S Ray1, S

Chong4, S Jafarlou2, S Yu4, T F Wu2, W Y Neo4, X Ding4, Y Wang4, Z Yan1, Z Sun4, S Jantzi2, L Tse1, K Chang1

Paper Affiliation: 1Marvell, Santa Clara, CA, 2Marvell, Irvine, CA, 3Marvell, Ottawa, ON, Canada, 4Marvell, Singapore, Singapore,

5Marvell, Burnaby, BC, Canada, 6Marvell, Pavia, Italy

Subcommittee Chair: Thomas Toifl, Cisco Systems, Thalwil, Switzerland

CONTEXT AND STATE OF THE ART

• Accelerated by AI/ML-driven growth in computing performance, system disaggregation into chiplets using organic or advanced

packaging technologies with high bandwidth density has introduced challenges such as xtalk, low power and low latency

circuit design.

• In support of rapidly increasing demand for higher bandwidth and reach, ultra-high-speed optical interconnects have found

new applications such as data center switches to extend the reach of electrical interconnects.

TECHNICAL HIGHLIGHTS

• TSMC presents the industry first 32Gb/s/lane UCIe-compliant advanced (UCIe-A) packaging interface, via a 1.7mm

silicon bridge, achieving 10.5Tb/s/mm beachfront bandwidth density with an energy efficiency of 0.6pJ/b in 3nm

FinFET.

o The first demonstration of a 32Gb/s/lane UCIe-A compliant module, comprising 64 transmitter and 64 receiver lanes.

It achieves 30% higher beachfront bandwidth density than state-of-the-art die-to-die (D2D) work.

o The matched-delay architecture limits the data-to-clock relative phase drift to 0.3ps (< 1% UI), enabling dynamic

clock gating to tolerate instantaneous voltage drops and runtime recalibration for long-term temperature drift.

o Applying per-lane deskew correction improves the aggregate eye width from 26% to 61% UI.

o The D2D channel routing uses five metal layers in the bridge die where each signal is shielded by ground to minimize

crosstalk noise.

Marvell presents a 4-lane 212Gb/s PAM-4 complete re-timer solution with the first CMOS integrated high-swing optical

driver and chip-to-module (C2M) with capability of 40dB loss compensation, fabricated in 5nm FinFET.

o The integrated high-swing optical driver achieves a swing of 3Vppd with a bandwidth of 54GHz when driving a Mach-

Zehnder Modulator.

o Direct DAC-based transmitter architecture achieves 0.9Vppd swing with 60GHz BW.

o The design employs a back-to-back latch-based retimer to improve the timing window and single-edge jitter

attenuation technique.

o This design achieves 3 orders of magnitude better BER than prior art while communicating over a 2dB higher loss

channel.

APPLICATIONS AND ECONOMIC IMPACT

• UCIe interconnects enable system disaggregation into chiplets, communicating over 2/2.5/3D channels with high throughput at

low power consumption and latency. By enabling chiplet reuse and modularity, UCIe can accelerate time-to-market.

• 200+ Gb/s complete re-timer solution, including chip-to-module interfaces to drive long electrical interconnects in GPU-based

racks, can enable optical direct detect applications such as top-of-rack switches.

• Demonstrated circuit architectures with robust performance pave the way to mass deployment in commercial applications and

can lead to future standards such as co-packaged applications.

Session 37 Overview: Design-Technology Optimization

and Digital Accelerators

Digital Circuits Technique Subcommittee

Session Chair: Jae-sun Seo, Cornell Tech, New York, NY

Session Co-Chair: Mahmut Ersin Sinangil, NVIDIA, Santa Clara, CA

This session presents eight papers that push the boundaries of digital circuit techniques for design-technology optimization, domain-specific computing

and digital accelerators, aiming to enhance energy efficiency, system performance, and application-specific capabilities. The first paper explores

design-technology co-optimization (DTCO) for an industrial processor design, and the second paper presents a chiplet solution for networks-on-textiles

with system-on-chip and networking chiplets. The third paper demonstrates a compute-in-memory (CIM)-based microprocessor utilizing embedded

MRAM for neural network inference, and the fourth paper reports a reusable active TSV-interposer with programmability. The fifth paper introduces a

complete KSAT solver that achieves 100% solvability or proves unsatisfiability, and the sixth paper showcases a diffusion accelerator leveraging SRAM

CIM and eDRAM storage. The seventh paper highlights a GPS acquisition accelerator designed with energy-accuracy-driven optimization and

computing, and the eighth paper presents a low-power keyword spotting system featuring on-chip training for accented users.

• In Paper 37.1, IBM and University of Bonn present design technology co-optimization for the 8-core 5.5GHz IBM Tellum II processor in 5nm,

packing 1.38× more transistors in 1.13× area footprint and 1.05× power envelope, while simultaneously meeting a reliability target of 99.999999%

system uptime. Key innovations in standard cell library architecture, low-power sequential circuits, soft-error-tolerant design and design-for-

manufacturability are demonstrated.

• In Paper 37.2, researchers from the University of Virginia and Nautilus Defense LLC present two chiplets in 65nm, a system-on-chip and a

networking chiplet bySPI, that are designed for direct-die attachment to an embroidered conductive-yarn in-textile bus. The 0.6mmx2.15mm

chiplets enable networks-on-textiles that operate at 1.8V with the SoC consuming 3.34mW at 80MHz and the bySPI consuming 0.27mW at

34MHz.

• In Paper 37.3, authors from Samsung Advanced Institute of Technology, Seoul National University, Samsung Electronics and Harvard University

present an MRAM IMC-based processor for neural network inference in 28nm technology. Featuring 1.1Mb of embedded MRAM with system-

wise calibration techniques, this work achieves 20TOPS/W processor-level energy efficiency for 1b precision at 1.0V.

• In Paper 37.4, Fudan University and Kiwimoore Semiconductors present a 586mm2 reusable active TSV-interposer in 28nm CMOS

TSV/microbump technology integrating 512Mb SRAM and programmable 2.5D and 3D Network-on-active-interposer with up to 307.2GB/s BW,

integrated buck VRs, PLLs and a boot CPU.

• In Paper 37.5, Peking University presents a complete K-SAT solver in 28nm, achieving 100% solvability and unsatisfiable problem detection with

an SRAM-based dual-path macro, an incremental updater, and a position-encoded counter. This chip achieves average solving time of 218-

clause 50-variable SAT problems in 17.1ms or proves unsatisfiability in 42.1ms average time, at 0.9V and 200MHz, consuming 3.39mW.

• In Paper 37.6, researchers from Peking University describe a 864kb SRAM CIM-based accelerator in 22nm with 3Mb eDRAM storage for diffusion

models that perform image, 3D and video generation. The chip features bandwidth-aware memory partition and a bitline-segmented CIM cluster

design, achieving 60.81TFLOPS/W for INT8/BF16 hybrid precision at 180MHz and 0.6V.

• In Paper 37.7, researchers from Seoul National University and Columbia University present an end-to-end GPS acquisition accelerator using

mixed-radix IFFT optimizations and ROM-assisted computing in 28nm technology. Achieving 18.1mJ/acquisition at 88.5MHz and 0.6V, the

proposed accelerator supports acquisition of 32 satellites across all 41 doppler bins.

• In Paper 37.8, researchers from the Korea Advanced Institute of Science and Technology, ETH Zürich and Columbia University present a keyword

spotting system to recognize 35 keywords spoken by users with accents, featuring efficient feature extraction and on-chip training. The 28nm chip

achieves 92.2% accuracy on accented speech, while consuming 10.93mW for inference and 13.46mW for training at 0.75MHz under a 1V analog

and 0.7V digital supply.

Session 37 Highlights: Design-Technology Optimization

and Digital Accelerators

[37.6] A 22nm 60.81TFLOPS/W Diffusion Accelerator with Bandwidth-Aware Memory

Partition and BL-Segmented Compute-in-Memory for Efficient Multi-Task Content

Generation

Paper Authors: Yiqi Jing, Jiaqi Zhou, Yiyang Sun, Siyuan He, Ru Huang, Le Ye, Tianyu Jia

Paper Affiliation: Peking University, Beijing, China

Subcommittee Chair: Huichu Liu, Meta, Sunnyvale, CA, Digital Circuits

CONTEXT AND STATE OF THE ART

• Diffusion models have rapidly expanded into many content-generation tasks, e.g. 3D scenes or video, and deliver exceptional

performance. Due to the large number of denoising steps, generating content using such models consumes significant

latency.

• Compute-in-memory (CIM) accelerators improve computational performance thanks to efficient array-level parallelization, but

their use for practical image or multi-task diffusion models remains limited due to dynamic memory bandwidth requirements,

suboptimal CIM macro utilization, and compute-intensive consistency operations of emerging diffusion models.

TECHNICAL HIGHLIGHTS

• Peking University reports a CIM-based accelerator for diffusion models targeting image, 3D, and video generation

tasks, integrating 3Mb eDRAM storage, 864kb digital-CIM SRAM and a RISC-V CPU. The 22nm chip achieves

60.81TFLOPS/W at 180MHz and 0.6V for state-of-the-art diffusion models on customized hybrid BF16-W4A8

precision.

• A dynamic bandwidth-aware memory partition scheme is developed with on-chip eDRAM storage to optimize CIM utilization

and reduce external memory access, achieving a 2.68 performance gain with optimized hardware utilization.

• A bitline-segmented CIM cluster is designed with reuse-aware weight reordering, enhancing macro energy efficiency by

2.17. A hierarchical consistency optimization flow is presented to minimize frame/pixel-level operations and offers 3.71

speedup.

• Multiple content generation tasks are evaluated using state-of-the-art diffusion models, such as SD-v1.5, Wonder3D, and

SVD, based on customized hybrid BF16-W4A8 quantization The 22nm chip achieves 60.81TFLOPS/W at 180MHz and 0.6V.

APPLICATIONS AND ECONOMIC IMPACT

• The contributions in the demonstrated compute-in-memory-based diffusion model accelerator can overcome the memory

bottleneck and further advance energy-efficient image/video generation tasks.

• Improved performance enables running additional consistency operations between views and frames for smooth rendering in

generative 3D and video applications.

Trends

ISSCC 2020

TRENDS

ISSCC 2025

TRENDS

Conditions of Publication

PREAMBLE

The Session Overviews and Highlights to follow serve to capture the context, highlights, and potential impact, of the papers to be

presented in each Session at ISSCC 2025 in February.

OBTAINING COPYRIGHT to ISSCC press material is EASY!

You may quote the Subcommittee Chair as the author of the text if authorship is required.

You are welcome to use this material, copyright- and royalty-free, with the following understanding:

o That you will maintain at least one reference to ISSCC 2025 in the body of your text, ideally retaining the date and

location. For detail, see the FOOTNOTE below.

o That you will provide a courtesy PDF of your excerpted press piece and particulars of its placement to

lcfujino@aol.com and shahriar@ece.ubc.ca.

FOOTNOTE

• From ISSCC’s point of view, the phraseology included in the box below captures what we at ISSCC would like your readership

to know about this, the 72nd appearance of ISSCC, on February 19th to February 20th, 2025.

This and other related topics will be discussed at length at ISSCC 2025, the foremost global forum for new

developments in the integrated-circuit industry. ISSCC, the International Solid-State Circuits Conference,

will be held virtually on February 16 - February 20, 2025

ISSCC Press Kit Disclaimer

The material presented here is preliminary.

As of November 5, 2024, there is not enough information to guarantee its correctness.

Thus, it must be used with some caution.

HISTORICAL TRENDS IN TECHNICAL THEMES

ANALOG SYSTEMS

ANALOG SUBCOMMITTEE

POWER MANAGEMENT SUBCOMMITTEE

DATA CONVERTERS SUBCOMMITTEE

Analog – 2025 Trends

Subcommittee Chair: Viola Schaffer, Texas Instruments Deutschland GmbH, Freising, Germany

As in previous years, ISSCC 2025 continues the focus on energy-efficient precision. New analog technique combinations enable

improvements in the performance of oscillators, amplifiers, references, sensor interfaces, and other circuits.

Significant advances in the energy-efficient stabilization of RC oscillators are disclosed. Figure 1 shows oscillator temperature stability

vs. energy per cycle of an RC oscillator among others published over the last 16 years. By using a reference replica to continuously and

dynamically cancel both comparator offset and delay, the RC oscillator achieves a TC of 9.83ppm/°C, limited by resistor temperature

drift, at 0.4pJ/cycle.

Disclosures on crystal oscillators include low-complexity analog methods for controlling conduction angle to reduce crystal-oscillator

power dissipation to sub-nano-Watt levels.

Figure 1: Trends in RC oscillator stability and energy per cycle.

The same theme of energy-efficient realization of high precision circuits prevails in amplifiers. A boosted audio amplifier uses a single-

inductor buck-boost power stage to directly transfer power from battery to speaker, potentially mitigating the losses associated with the

traiditonal two-stage boost-converter-Class-D combination. Other amplifer techniques focus on improving the bandwidth-power efficiency

and temperature stability of precision amplifiers.

0.1

100

1000

10000

100000

0.100 1.000 10.000 100.000 1,000.000

Stability (ppm/oC)

Energy/Cycle (pJ)

ISSCC VLSI

CICC ESSCIRC

ASSCC ISSCC2025

The theme extends to interfaces for a variety of sensors. Figure 2 shows the performance of two temperature sensors presented at

ISSCC 2025, among other published temperature sensors. Precision and low-power circuit techniques enable an NPN-based

temperature sensor to achieve an 80fJ∙K2 resolution FoM and a 3σ inaccuracy of ±0.1°C, and a wire-metal-based temperature sensor

to achieve a 45fJ∙K2 resolution FoM and a 3σ inaccuracy of ±0.2°C. These represent new levels of combined energy efficiency and

absolute accuracy in CMOS temperature sensors.

In capacitive sensor interaces, time-domain signal processing, two-step conversion, and continuous-time signal chains enable

improvements in performance for a diverse range of applications. A capacitance-to-digital converter (CDC) combines a period-

modulation-SAR front-end with a SAR-ADC backend to achieve 14.5b ENOB and 6.1µs conversion time across a 1000× scalable full-

scale range. Another CDC combines a SAR capacitance-to-voltage-converter front-end with a continuous-time highpass noise-shaping

SAR-ADC backend to achieve a 14.5fJ/conversion step FoMW at 5.12pF full-scale range. A third CDC combines a SAR front-end with a

controlled-oscillator-based 2nd-order delta-sigma modulator backend to achieve an 18.2fJ/conversion step FoMW at 1.2pF full-scale

range. A fourth CDC combines a capacitance-to-phase-converter front-end with a phase-domain delta-sigma modulator to achieve a high

degree of immunity to EMI in floating-target precision-displacement-sensing applications.

In other sensor interfaces, duty cycling and dynamic biasing improve power efficiency and scalability. A MEMS-gyroscope paritially duty-

cycles key circuit blocks to minimize standby power dissipation, while enabling sufficiently fast wake-up for always-on applications. A

continuous-time delta-sigma modulator uses a frequency-controlled biasing method to enable a 225× adjustable conversion rate while

maintaining its signal-to-noise ratio. Finally, to serve increasing system-level speed requirements, techniques are presented for improved

high frequency power-supply rejection of voltage references, for a temperature- and aging-stable frequency management module and

for high speed track-and-hold circuits.

Figure 2: Trends in Resolution FoM of temperature sensors.

1.00E-06

1.00E-05

1.00E-04

1.00E-03

1.00E-02

1.00E-01

1.00E+00

1.00E+01

1.00E+02

1.00E+03

2014 2016 2018 2020 2022 2024

Resolution FOM (nJ·K2)

Year

BJT

Res

ISSCC 2025

Power Management – 2025 Trends

Subcommittee Chair: Bernhard Wicht, Leibniz University, Hannover, Germany

Integrated power management is an indispensable building block in all electronic devices and has become one of the most critical parts

of ever-demanding systems, ranging from high-performance computing, 5G/6G communications, energy harvesting, wireless power

transfer, automotive and various other consumer and industrial applications. The diverse applications come with a large variety of

input/output voltage and current requirements for the integrated power management units that are both challenging and very exciting for

this unique area of solid-state circuits. For example, to power high-performance processors and AI/ML accelerators at the sub-1V core

voltage levels at the power up to 1-3 kW, the integrated power regulators need to convert from very high voltages (~48V) to reduce the

input distribution loss. Supply modulators for power amplifiers in 5G/6G transmitters need to regulate the outputs with near-instantaneous

changes in voltage and current levels, while other applications like automotive require extreme reliability and careful design to minimize

electromagnetic interference (EMI). Some integrated power converters must transfer power and/or signals across isolation boundaries,

and yet others must efficiently process and recover energy from intermittent low-power transducers. With this diversity, integrated power

management needs to be custom designed to specific applications and requirements. While this diversity makes it harder to evaluate the

quality of a certain design, common key performance indicators (KPIs) for integrated power converters are high power and/or current

density, high efficiency (minimal power loss), small overall footprint, and low cost.

In order to meet the wide range of challenges associated with the vast diversity of

applications and input and output requirements, researchers in integrated power

management need to investigate a large set of technologies from converter topologies,

active and passive devices, to methods of integration. This is well reflected in the

submissions and accepted papers at ISSCC in every previous year and also this year.

Figure 1 and 2 on the right illustrate this trend on the submitted and accepted papers to

this year’s ISSCC. There are relatively small shifts of percentage among different

converter categories from submitted papers to accepted papers because of the large

difference between the numbers of submissions (124) and accepted papers (30) with a

competitive ~24.2% acceptance rate.

Notably, the diversity of topics and their relative shares of the pool stay relatively

consistent, in which hybrid converter stands out to hold the largest share. This is

evidence of an apparent increase in interest from both academia and industry to combine

and leverage the benefits of both inductor and capacitor and low-voltage switches to

tackle the challenges in power delivery, particularly in simultaneously achieving high

power density, high efficiency, and large input-output voltage difference. Following

publications from previous years, hybrid converter papers in this year continue to explore

new circuit designs and techniques to address a wide range of input-output voltages,

provide more output voltages, mitigate the voltage and current stresses for inductors

such that size reduction is possible, balance flying capacitor voltages, or reduce output

voltage ripples and increase transient responses, etc.

It is no doubt a daunting task to evaluate and fairly compare the performance among the converters, especially when they are designed

for different applications with different sets of specifications. However, the common key performance indicator (KPI) of efficiency versus

power density (the higher the better for both) is generally agreed among experts in the field. This metric is used to compare the

performance of hybrid converter papers accepted to ISSCC 2025 and in previous years in Fig. 3 and 4 on the right. Power density is a

good metric, but only fair in comparing converters with the same voltage conversion ratio (VCR: the ratio between the input and output

voltages, Vin/Vo) and the same maximum input voltage. Higher input voltages and higher VCR would make it much more difficult to

achieve the same efficiency at the same power density. Figures 3 and 4 show that ISSCC 2025 papers continue to push the limit of

converter designs for higher efficiency and higher power density at higher VCRs and higher input voltages. While it is more difficult to

apply the same metric to papers in the other categories, similar considerations for high efficiency, small overall footprint, and low cost

are used for evaluation. ISSCC attendees are advised to pay close attention

to these metrics to assess certain papers and techniques at the conference.

It is of particular interest to the members of technical program committee

(TPC) in power management (PM) as well as ISSCC TPC at large to see the

trend in these topics over the year, presented in Fig. 5. While the 3-year data

have a good correlation with the submission numbers and interests, the exact

number in each topic and fluctuations over the year also depends on the

quality of works and paper submissions, the diversity of PM TPC members

and their possible charging perspective over the year. With that being

considered, the four notable trends in the last 3 years of integrated power

management are: 1) Power management will stay very diverse, 2) Hybrid

converter is the most popular, followed by wireless power transfer (WPT) and

energy harvesting (EH), 3) Supply modulator becomes more important, 4)

Isolated power converter draws more interest, and 4) LDO research is still

interesting.

Aligned with the interests and trend to increase input voltages to reduce the

input distribution loss while keeping the cost low for fabrication, we see an

undeniable favor of higher-voltage Bipolar-CMOS-DMOS (BCD) processes as

reported in Fig. 6. The researchers are spending a lot of effort in addressing

one of the most challenging issues in the industry today: achieve the KPIs at

a higher input voltage level.

One of the most important emerging technologies that is expected to dominate

high-voltage power converters is GaN power IC. However, in the last 3 years,

GaN power ICs have struggled to gain a larger foothold at ISSCC with the

number of accepted papers limited to three or below each year. This reflects the early

stage of the technology where getting access to a GaN process and fabrication is a big

challenge in itself. Another headwind for this direction is the challenge to prove its

superiority compared to a BCD process, e.g., 180nm BCD, in a practical implementation

even when GaN power switches can theoretically offer 10X-100X higher FOMs (RdsQg)

compared with the BCD counterpart. In addition, GaN device experts are still trying to

improve the overall yield and reliability of the process and GaN devices. Regardless,

there is a good sign in the industry with the superior performance promise and and

research synergy in academia that GaN power IC will still keep its foothold, even still at

of the relatively small number of publications each year, waiting for a larger increase in

a near future when the key drawbacks of fabrication access, yield, and reliability are

alleviated.

One the side of low voltage but extremely high output current delivery for high-performance computing, this ISSCC 2025 will exhibit the

first ever complete fully-integrated switching converter in a 3nm FinFET process that includes zero current detection, and soft-switching

techniques to enable the operations in both discontinuous conduction mode and continuous conduction mode. The converter utilizes a

next-generation package integrated coaxial magnetic integrated inductor (CoaxMIL) to achieve a remarkable output current density of 55

A/mm2.

The readers of this article are advised to also watch the highlights and full contents of the four power management sessions at ISSCC

2025 to experience, to the fullest, the diversity, the breadth, and the depths of one of the most popular and important areas of the solid-

state circuits society today.

Data Converters – 2025 Trends

Subcommittee Chair: Jan Westra, Broadcom, Bunnik, The Netherlands

Achieving high resolution at high sampling rates has been the driving force behind creativity and new architectural advancements in

wireline and wireless applications. This year at ISSCC, high-speed ADCs with resolutions exceeding 12 bits and conversion speeds

beyond several GHz are showcased, demonstrating remarkable energy efficiency. New architectural choices, circuit innovations, and

calibration techniques are pushing the boundaries of performance. Challenges in advanced nodes are addressed and innovative

parallelization in pipeline SAR ADCs are introduced. The use of time-domain and ring oscillator circuits is leveraged to maximize the

converter speeds, taking full advantage of technology scaling. Various noise-shaping techniques and SAR/Pipe-SAR designs are

explored, achieving over 180dB Schreier FoMs and more than 90dB SNDR for high-fidelity applications.

The three figures below represent traditional metrics that capture the innovative progress in ADC design. The first figure plots power

dissipated relative to the Nyquist sampling rate (P/fsnyq), as a function of signal-to-noise and distortion ratio (SNDR), to give a measure of

ADC power efficiency. Note that a lower P/fsnyq metric represents a more efficient circuit on this chart. For low- to medium-resolution

converters, energy is primarily expended to quantize the signal; thus the overall efficiency of this operation is typically measured by the

energy consumed per conversion and quantization step. The dashed trend-line represents a benchmark of 1fJ/conversion-step. Circuit

noise becomes more significant with higher-resolution converters, necessitating a different benchmark proportional to the square of the

signal-to-noise ratio, represented by the solid line. Designs published from 1997 to 2023 are shown in circles. ISSCC 2024 designs are

shown with blue stars and ISSCC 2025 designs are shown with orange stars.

The second figure plots signal fidelity vs. the Nyquist sampling rate normalized to power consumption. At low sampling rates, converters

tend to be limited by thermal and flicker noises, independent of the sample rate. Higher speeds of operation present additional challenges

in maintaining accuracy in an energy-efficient manner, indicated by the roll-off vs. frequency in the dashed line. The last ten years have

resulted in an improvement of over 10dB in power-normalized signal fidelity, or a 10× improvement in speed for the same normalized

signal fidelity. At ISSCC this year, hybrid pipelined-SAR and noise-shaping SARs are continuing the trend in the speed vs efficiency

corner of the graph. Time-based conversion of signal and time-domain quantization are becoming an integral part of converters in scaled

nodes. Time-interleaving multiple channels with error correction over process, voltage and temperature (PVT) as well as clock skew

between channels is a necessity in these architectures to achieve robust performance in high-speed conversion rates.

The final figure plots ADC bandwidth as a function of SNDR. Sampling jitter or aperture errors deteriorate the ADC resolution and signal

bandwidth. In recent years we have seen designs with aperture errors below 100fsrms published at ISSCC.

Figure. 1. Power dissipated relative to the Nyquist sampling rate (P/fsnyq) versus SNDR.

1.E-01

1.E+00

1.E+01

1.E+02

1.E+03

1.E+04

1.E+05

1.E+06

1.E+07

10 20 30 40 50 60 70 80 90 100 110 120

P/fsnyq [pJ]

SNDR @ fin,hf [dB]

ISSCC 2025

ISSCC 2024

ISSCC 1997-2023

VLSI 1997-2024

FOMW=1fJ/conv-step

FOMS=185dB

Figure. 2. Schreier figure of merit versus Nyquist sampling rate.

120

130

140

150

160

170

180

190

200

1.E+02 1.E+04 1.E+06 1.E+08 1.E+10 1.E+12

FOMS,hf [dB]

fsnyq [Hz]

ISSCC 2025

ISSCC 2024

ISSCC 1997-2023

VLSI 1997-2024

Envelope

Figure. 3. Input bandwidth versus SNDR.

1.E+06

1.E+07

1.E+08

1.E+09

1.E+10

1.E+11

10 20 30 40 50 60 70 80 90 100 110 120

fin,hf [Hz]

SNDR @ fin,hf [dB]

ISSCC 2025

ISSCC 2024

ISSCC 1997-2023

VLSI 1997-2024

Jitter=1psrms

Jitter=0.1psrms

HISTORICAL TRENDS IN TECHNICAL THEMES

COMMUNICATION SYSTEMS

RF SUBCOMMITTEE – WIRELESS SUBCOMMITTEE

WIRELINE SUBCOMMITTEE

RF – 2025 Trends

Subcommittee Chair: Brian Ginsburg, Texas Instruments, TX

The RF papers to be presented at ISSCC 2025 demonstrate record-setting advancements in phase-locked loops (PLLs), voltage-

controlled oscillators (VCOs) and multipliers, RF power amplifiers (PAs), low noise amplifiers (LNAs), phase shifters, and transmitters

(TXs). These innovations are driven by emerging requirements in 5G and 6G communications, the Internet of Things (IoT), radars, and

imaging at RF, mm-wave, and THz frequencies. This document provides a highlight of these trends.

Frequency Synthesis: At ISSCC 2025, papers introducing new PLL concepts for generating RF and mm-wave frequency carriers will

be presented. These papers feature concept demonstrations with record-setting achievements in low jitter, low-power consumption, and

settling time. These achievements extend the number of publications with jitter-power FoM below -250dB for both fractional-N and integer-

N PLLs, as shown in Fig. 1.

For instance, a cascaded 4.7-to-5.7GHz fractional-N PLL adds a fractional divider between its two stages. By sharing the DSM of the two

fractional dividers, they are able to cancel their quantization error, achieving 96fsrms jitter, a -70.6dBc fractional spur, and a -247.1dB FoM.

Additionally, a cascaded 5.5GHz fractional-N RO-based digital PLL achieves <290.8fsrms jitter with a 10% variation over PVT and -

237.6dB FoM by utilizing a TVC and ADC to maintain low-quantization-noise phase-noise quantization and voltage-domain feed-forward

noise cancellation. Moreover, a low-voltage 10.4-to-11.8GHz fractional-N PLL achieves 73.8fsrms jitter and -251.3dB FoM by utilizing a

hybrid cascaded DPD to compensate for the INL of a low-voltage DTC and sampling PD. A 10.1GHz fractional-N digital PLL minimizes

DTC static and dynamic delays, exploiting a quantization error compensating BBPD and attains 65fsrms jitter and -252.2dB FoM. Another

fractional-N PLL reaches 38.5fsrms and -254dB FoM by employing a multipath feedback technique based on multiple weighted CP

components and re-timed feedback paths to reduce DSM quantization noise and mitigate in-band noise. Furthermore, a 27GHz fractional-

N subsampling PLL features a polarity-reversible subsampling PD to reduce the DTC range. The automatic TDC gain calibration of the

FLL enables a <2µs locking time. This PLL exhibits 57.9fsrms jitter and -249.7dB FoM.

Two papers target fast-hopping LO generation. One paper describes a 2.25-to-2.54GHz digital PLL-based GFSK modulator that utilizes

an ultra low-power DCO and fast-converging gain estimation to achieve a 2.1% FSK error with a sub-20µs settling time and power

dissipation of only 380µW. The other paper discusses a 53.2-to-64.9GHz SSB-mixer-based integer-N PLL with a <1ns settling time. It

addresses mixer-induced spurs with an injection-locked oscillator filter and background I/Q calibrations, resulting in a -56dBc worst-case

spur level and 60.3fsrms jitter.

Additionally, two PLLs exploit low-PN oscillators to enhance their performance. One PLL uses an adaptive technique to automatically

tune the common-mode resonance of a harmonic-shaping DCO integrated into a 4.75GHz integer-N digital PLL, achieving a 45.8fsrms

jitter and -257dB FoM. The other PLL incorporates a series-resonance oscillator in a 13GHz charge-pump integer-N PLL and, with the

assistance of a time-amplifying PFD and a sampling-based filter for the reference spur, attains 15.8fsrms jitter and -257dB FoM.

One paper discusses the co-designs a PLL and XO. The XO provides a clean reference to the PLL, while the PLL provides injection

timing with a narrow angle for the pulse XO driver. This integer-N PLL operates at 4.6GHz and achieves 63.3fsrms jitter and -255.2dB

FoM, including the XO power and noise.

Additionally, another paper presents an 11-to-16.4GHz FMCW digital PLL that proposes a DPD technique to compensate for the DCO

non-linearity. It achieves a 0.043% rms frequency error under 3.4GHz/μs slope and a 5.32GHz bandwidth.

Fig. 1. PLL trends. The ISSCC 2025 results are identified in red.

Waveform shaping and series-resonator-based oscillators: The ISSCC 2025 will showcase five new contributions to RF and mm-

wave oscillators. This year, three waveform shaping oscillators will be presented. The first is an 18.5-to-23.6GHz quad-core class-F2,3

oscillator, which achieves a peak FoM of 193dBc/Hz and a <250kHz 1/f3 PN corner without 2nd/3rd harmonic manual tuning. The second

oscillator covers a frequency range from 9.05 to 37GHz (121%) and utilizes a new magnetic mode switching to achieve its high tuning

range. It attains >190.7dBc/Hz FoM at a 10MHz offset frequency through a tuning-free octave BW common-mode resonator. The last

oscillator operates within the 47.3-to-58.4GHz range and is a quasi class-E Colpitts oscillator based on a two-port resonator, achieving

a peak FoMT of 198.8dBc/Hz. The remaining two oscillators are both CMOS series-resonator based. One utilizes a mixed-mode resonator

consisting of a series-LC tank magnetically coupled with a parallel-LC tank and a single-inverter active core. This 8.1-to-9.9GHz oscillator

achieves -124 to -128.7dBc/Hz PN at a 1MHz offset frequency while consuming <36.3mW. The other oscillator operates within 7.65-to-

9.13GHz range and is a differential CMOS series-based resonator with a pole convergence technique, reaching -150.5 to -153dBc/Hz

PN and peak FoM of 190dBc/Hz at a 10MHz offset frequency.

Front-End Circuits for High-Performance Transceivers: ISSCC 2025 features eleven new contributions to RF front-end circuits, which

includes two LNA papers, five PA papers, and four TX (including RF DAC) papers. A highly linear 65nm-CMOS 27GHz LNA employing

Doherty active load modulation is presented to improve its blocker tolerance. It achieves minimum NF of 2.5dB and IIP3 of -2.5dBm. A

low-noise active bandpass filter (LNA-BPF) based on a cascode amplifier with a tunable band-stop filter to create the passive feedforward

cancellation is proposed in a 22nm FDX process. It shows the minimum NF of 3.6 to 4.2dB across the 22-to-27GHz band and sharp

filtering characteristics. A PA architecture that combines the merits of Doherty and switchless class-G (SLCG) PAs is proposed in a

250nm GaN process. Over 38% 6dB back-off drain efficiency and 29.1-to-31dBm average output power (Pavg) for a 100MHz 64-QAM

LTE signal are demonstrated over a 1.35-to-7.6GHz frequency range. A VSWR-resilient PA that uses a proposed LC-complex cascode

architecture is presented in a 22nm SOI CMOS technology. It achieves the Pavg of 9.5 and 6.8dBm for50Ω and 4:1 VSWR load

conditions for a 39GHz 100Mbaud 64-QAM signal. Time-modulated PA chain architecture supporting digitally controlled beamforming

with high back-off efficiency is proposed in a 0.18µm SiGe process. It shows the 6dB back-off efficiency of 16% and enables ±30° beam

steering without phase shifters at 24GHz. A 24-to-29GHz Doherty PA using 3-stacked amplifier cores with an operational amplifier (OPA)-

based bias scheme is presented in a 22nm SOI CMOS technology. It achieves 23-to-24.1dBm Psat, 6-to16.8dBm Pavg for 800Mbaud

64-QAM OFDM signal and 1000h temperature reliability. A 56-to-64GHz PA with scalable matched-zone-expanding radial power

combining, which uses the combination of voltage and current combining, is presented in 40nm bulk CMOS. The PA demonstrates Psat

of 30.2dBm and OP1dB of 28.9dBm. A 1b digital polar transmitter employing a CORDIC-less polar encoding scheme that converts multibit

I/Q input into a 1b AM and 3b PM signals is proposed in 65nm CMOS. It supports 1.35GHz 50Mbaud 256-QAM modulation with 16.9dBm

output power, 20.2% PAE, and 19.5% system efficiency. A digital 40nm-CMOS PA using an in-cell fast slope-to-phase self-calibration

technique is proposed to improve AM-PM linearity of a switched-capacitor PA (SCPA) operating at high frequency and supporting

modulation signals without DPD. It achieves 27.7dBm peak Pout and 37.8% peak PAE at 4.7GHz and supports 160Mbaud 64-QAM and

80Mbaud 256-QAM signals without DPD. A high-power digital transmitter with dynamic retiming and a glitch-free phase mapper operation

is proposed using 40nm CMOS and LDMOS. Operating at 1.7GHz, the transmitter achieves 43dBm peak output power, 63% peak system

efficiency, and 34.9dBm Pavg with 28.1% system efficiency under 53Mbaud 64-QAM modulation. A 21-to-31GHz reconfigurable

2×8b/2×11b quadrature RFDAC with an impedance-compensation array for invariant impedance and digital pre-modulator for scalable

LO leakage is presented in a 40nm CMOS technology. It supports 2.4Gb/s 256-QAM modulation without DPD. Fig. 2 illustrates Psat vs

frequency trends of other PAs including those that will be presented at the ISSCC 2025.

Components for Beyond 100GHz: ISSCC 2025 introduces several exciting and innovative circuit components operating at D band and

beyond. In particular, two phase shifters at D band, one multiply-by-16 radiation source at 250GHz, one 32 way power amplifier (PA) at

220GHz, and one voltage controlled oscillator (VCO) at 225GHz are showcased. One phase shifter is based on a frequency/switching-

staggered 5-stage reflective-type phase shifter and achieves bi-directional and calibration-free 360° phase shifting in a 110-to-140GHz

band. The phase shifter is implemented in 22nm CMOS SOI and shows 11.25° phase resolution with rms phase and gain errors of 0.64°

to -2.38° and 0.12dB to 0.63dB across the band, respectively. Another phase shifter achieves a maximum output power at 1dB gain

compression (OP1dB) of 3.5dBm at 130GHz by eliminating the variable-gain amplifier (VGA) and replacing it with phase shifters for gain

and phase calibration. This work was implemented in 55nm SiGe and shows 360° phase control in 9° steps across 125 to 170GHz. The

multiply-by-16 radiation source takes advantage of a patterned dielectric matching sheet installed at the back of the chip to enhance the

wave coupling from silicon to air and to replace a silicon lens with a low-cost and planar alternative. This radiator is implemented in 16nm

FinFET CMOS and operates from 232 to 260GHz with a peak radiation power and equivalent isotropic radiated power (EIRP) of 11.1dBm

and 24.5dBm, respectively. A varactor-less VCO is presented that uses multiple coupled line resonators along with switches to improve

the tuning range at 224GHz. The VCO is implemented in 22nm FDSOI and achieves 19.9% tuning range with peak FoMT of -182dBc/Hz.

Finally, large-signal impedance correction is used in a 50nm GaN HEMPT PA to achieve 1350mW/mm power density at 216GHz. This

individual PA is packaged and 32 of them are combined to generate >1W of power at 216 to 226GHz.

Fig. 2. PA trends.

Wireless – 2025 Trends

Subcommittee Chair: Chih-Ming Hung, MediaTek, Taipei, Taiwan

The persistent demands for high data throughput and low power consumption continue to drive the development of wireless integrated

transceivers. The crowded frequency spectrum further motivates interference-tolerant receivers. This year at ISSCC 2025, several

transmitter and receiver papers were selected to showcase the advancements in wireless systems across a wide range of applications,

including LTE, UWB, mm-Wave, and D-band frequency ranges. The key development includes:

1. Towards 6G communication systems: Increasing operation frequency and high-speed data transfer

Communication transceivers for 6G are designed to operate at higher frequencies with a wider bandwidth, such as those in the

frequency range 3 (FR3, 7 to 24GHz), millimeter-wave frequency (30 to 300 GHz) and even terahertz bands, which allow for ultra-

fast data transmission (up to 1 Tb/s) and ultra-low latency. According to the published papers this year, designs for wireless

communication transceivers and relays targeting 6G communication represent a significant portion of the total publications. On

Tuesday, a 29GHz satellite-communication phased-array receiver is presented, achieving a data rate up to 16Gb/s with two circular

polarization beams. Additionally, a D-band active relay transceiver was highlighted for its capability to perform 2-D angle-of-arrival

detection, demonstrating data rates of up to 6Gb/s for none-line-of-sight communication. Figure 1 demonstrates a visual

representation of the trends for data rates accomplished from millimeter-wave to sub-terahertz frequency.

Figure 1: Data-rate trend of mm-wave and sub-THz wireless transmitters and receivers.

2. FMCW radar: Increasing frequency, resolution and range

In the past few years, many 24GHz FMCW radars have been developed for sensing and imaging. These systems typically use single

PLLs to synthesize frequency. In the mm-wave range, 76-to-81GHz FMCW radars are widely used for vehicle distance sensing and

SLAM. These systems often feature frequency synthesizers that use PLLs with multipliers. In recent years, FMCW radar systems

are advancing into sub-THz range, offering expanded bandwidths of tens of GHz, which improve range resolution to the millimeter

level. Achieving such high frequencies often requires long frequency up-conversion chains. Multi-band approaches are also

proposed to enhance bandwidth. Figure 2 illustrates a clear trend, showing a general improvement in resolution from cm-level to

mm-level across the FMCW radar at different bands.

Figure 2: Radar resolution trend of mm-wave and sub-THz FMCW radar transceivers.

24GHz and 77GHz mm-wave radars have achieved transmission distances up to 100 meters. However, sub-THz radar systems

face significant propagation losses, around 28dB/m or more, limiting communication to less than 1 meter. Extending sub-THz

transmission ranges has become a key focus, with high-directivity antennas or arrays being a crucial solution. Recent designs using

AiP technology in D-band phased-array transceivers have shown promise. The accepted ISSCC 2025 paper highlighted in Figure 2

presents a D-band FMCW radar transceiver Antenna in Package (AiP), achieving a transmission range of 16.8 meters, which is one

of the largest values in D-band design, although the demonstrated range resolution is only 13mm.

3. Wide bandwidth, multiband receiver: low power, high linearity, and high blocker tolerance.

In recent years, wide-bandwidth (WB) receivers covering multiple communication protocols have garnered significant attention in

wireless design. These WB receivers not only provide a compact solution for a wide variety of applications but also reduce the overall

complexity of transceiver architecture, becoming the cornerstone of software-defined radio systems. However, the inevitable

presence of jammers imposes significant design challenges on the linearity and blocking tolerance of WB receivers. Figure 3 presents

how in-band IIP3 advanced with increasing receiver bandwidth. In ISSCC 2025, several notable advancements are presented:

● A 0.5-to-3GHz LNTA-based receiver achieving an adjacent IIP3 of 19dBm.

● A 31-to-34GHz mixer-first receiver utilizing a non-uniform time-approximation filter to achieve >65dB blocking rejection.

● A 0.5-to-5GHz full-duplex receiver employing multi-stage all-pass filters (APFs) to achieve >65dB self-interference cancellation

(SIC) for bandwidths greater than 120MHz.

Figure 3: Trend of Receiver IIP3 vs Bandwidth for Papers Published at ISSCC.

Wireline – 2025 Trends

Subcommittee Chair: Thomas Toifl, Cisco System, Thalwil, Switzerland

Ever-increasing bandwidth and performance demands, accelerated by the extensive deployment of AI/ML workloads, are straining

the capabilities of today’s electrical and optical interconnect solutions. To meet the performance needs of applications ranging from

telecommunication networks, to datacenters, to mobile devices, electrical and optical interconnect solutions with higher throughput

and energy efficiency are required. To keep pace, Figure 1 shows that the data rates per pin double almost every 3-4 years across

different IO standards. Figure 2 illustrates the data rates for published transceivers versus process nodes. While taking advantage

of process scaling, designing energy-efficient interconnects with 100-200+ Gb/s data rates face steep challenges of channel loss,

reflection, xtalk and noise. Figure 3 shows energy efficiency vs. channel loss of up to 52dB at the Nyquist frequency. Significant

innovations in energy-efficient analog and digital equalization, xtalk/noise cancellation, low-jitter clocking techniques, heterogeneous,

chiplet-based and co-packaged integration are made to meet the exponential bandwidth growth need.

Papers presented at ISSCC this year include examples of 212.5Gb/s wireline transceivers with 50dB and > 46dB loss compensation

capabilities, an XSR analog RX operating at 224Gb/s, a 212Gb/s re-timer for optical communication and a 32Gb/s 10.5Tb/s/mm

UCIe compliant interface:

1. A 212.5Gb/s DSP-Based PAM-4 Transceiver with 50dB Loss Compensation for Large AI Systems Interconnects in 4nm FinFET.

2. A 2.2pJ/b 212.5Gb/s PAM-4 Transceiver with > 46dB reach in 5nm FinFET.

3. A 1.11pJ/b 224Gb/s XSR Receiver with Slice-Based CTLE and PI-Based Clock Generator in 12nm CMOS.

4. A 353mW 112Gb/s Discrete Multi-tone Wireline Receiver Datapath with Time-Based ADC in 5nm FinFET.

5. A 212Gb/s PAM-4 Retimer with Integrated High-Swing Optical Driver and Chip-to-Module Long Reach Capability of 40dB in

5nm FinFET.

6. A 0.9pJ/b 108Gb/s PAM4 VCSEL-Based Direct-Drive Optical Engine.

7. A 112Gb/s 0.61pJ/b PAM-4 Linear TIA Supporting Extended PD-TIA Reach in 28nm CMOS.

8. A 1.54pJ/b 64Gb/s 16-QAM Intradyne Coherent Optical Receiver in 28nm CMOS.

9. A 50Gb/s Burst-Mode NRZ Receiver with 5-Tap FFE, 7-Tap DFE and 15ns Lock Time in 28nm CMOS for Symmetric 50G-PON.

10. An 8-28GHz 8-Phase Clock Generator using Dual-Feedback Ring Oscillator in 28nm CMOS.

Ultra-High-Speed Wireline and High-Performance Clocking Techniques

With explosive bandwidth demand of data-intensive applications such as artificial intelligence (AI) and high-performance computing

(HPC), 100Gb/s/lane interconnects are transitioning to 200Gb/s/lane to enable 1.6Tb/s Ethernet. Design innovations in equalization

techniques, multi-phase low-jitter clock generation, clock and data recovery (CDR), employing various modulation schemes such as

pulse amplitude modulation (PAM) and discrete multitone (DMT) are imperative to achieve such aggressive data rates with

competitive energy and area efficiency. In Paper 7.1, MediaTek presents a complete DSP-based PAM-4 transceiver at 212.5Gb/s,

implemented in 4nm FinFET and achieved <1e-10 and 2.5e-6 BER over a channel loss of 40 and 50.5dB, respectively. The energy

efficiency of transceiver is 5.3 pJ/b with 1.8pJ/b from DSP. A jitter-cleanup PLL (JCPLL) generates a clean, low-frequency reference

clock to 20-28GHz LC VCO which employs a high-Q inductor and second-harmonic LC filter achieve low phase noise performance.

The entire receiver clocking path is filtered by an LDO with noise canceling path. In Paper 7.2, Marvell demonstrates a 212.5Gb/s

transceiver with analog energy efficiency of 2.2pJ/b and loss compensation of 46dB, fabricated in 5nm FinFET technology. The

current mode transmitter shows an SNDR of 36.1dB and an RMS jitter of 73fs at a 212.5Gb/s data rate. In Paper 7.3, Peking

University presents a 224Gb/s XSR receiver (RX) in 12nm CMOS technology. The RX design, featuring a slice-based CTLE with a

3dB bandwidth of 59GHz, demonstrates a BER of 8.1e-7 over a 13.6dB channel loss with an energy efficiency of 1.11pJ/b. In Paper

7.5, Daegu Gyeongbuk Institute of Science and Technology demonstrates a 112Gb/s ADC-based DMT wireline RX in 5nm FinFET

with 1e-4 BER for 12dB channel loss at 26GHz. The DSP equalizes DMT symbols that are independently bit- and power-loaded

over 63 sub-channels to fit the channel SNR profile.

Low-jitter multi-phase clock generators and CDRs with fast lock time are key building blocks of wireline transceivers. In Paper 7.7,

Peking University presents an analog 50Gb/s burst-mode NRZ receiver in 28nm CMOS for symmetric 50G-passive optical network

(PON) applications. An edge equalization is introduced to enhance lock accuracy under large ISI and poor SNR, achieving a

maximum lock time of 15ns. In Paper 7.10, Fudan University reports an 8-phase 8-to-28GHz clock generator comprising a dual-

feedback ring oscillator injection locked by a delay locked loop. Fabricated in a 28nm CMOS process, the clock generator output

jitter and the maximum phase error are measured less than 40fs and less than 3°, respectively.

Ultra-High-Density Die-to-Die Interconnects

AI/ML-driven growth in computing performance has accelerated system disaggregation into chiplets using organic or advanced

packaging technologies (2D/2.5D/3D). The UCIe (Universal Chiplet Interconnect Express) has been developed to standardize chiplet

interconnects on a package and targets high energy efficiency and high bandwidth density with low latency. In Paper 36.1, TSMC

presents a 32Gb/s/pin UCIe-compliant advanced package D2D interconnect, connected via a 1.7mm silicon bridge. Fabricated in

3nm FinFET technology, each module, comprising 64 transmitter and 64 receiver lanes, achieves 10.5Tb/s/mm beachfront

bandwidth density at 0.6pJ/b. In paper 36.2, Peking University reports a 64Gb/s/wire single-ended simultaneous bi-directional

transceiver for D2D interconnects. A dynamic voltage threshold circuit decouples the bi-directional signal and cancels the crosstalk.

A synchronous clock and reset distribution network ensure precise crosstalk cancellation. Implemented in 28nm CMOS, the

transceiver measures 10.5Tb/s/mm beachfront bandwidth density at 1.21pJ/b and BER of less than 1e-16 on a 3mm shield-less on-

chip channel. In paper 36.3, Cadence demonstrates a 4-16Gbps/pin UCIe 2.5D over 1.4mm interposer D2D interconnect, achieving

5.27Tb/s/mm at 0.29pJ/b in 3nm FinFET. By running 8.192Tb/s of data traffic on 8 links, a 0.29UI eye opening at 1e-15 BER for

16Gb/s/pin is measured. The PHY latency is measured to be 3.5ns.

Ultra-High-Speed Optical Interconnects

As interconnect data rates continue to increase in support of rapidly increasing demand for higher bandwidth in hyperscale

datacenters, the reach of electrical interconnects is becoming shorter, despite significant recent advancements. Datacenter

interconnects combine direct-attach copper cables for intra-rack connectivity, pluggable optical modules for reach longer than a few

meters and coherent optical transceivers for long-haul telecommunication applications. Co-packaged optics (CPO) can address the

electrical interconnect scalability and reach/power bottleneck by closely integrating optical components with a computing system. In

Paper 36.4, Intel presents a 108Gb/s PAM-4 VCSEL-based direct-drive optical engine targeting CPO applications, achieving 0.9pJ/b

and 2.4e-4 BER. A VCSEL driver with linear complex-zero CTLE and a linear differential TIA with an active complex-zero CTLE,

fabricated in 22nm FinFET, are introduced to enable >100Gb/s PAM-4 operation. In Paper 36.5, Southern University of Science and

Technology reports a 112Gb/s PAM-4 TIA in 28nm CMOS, featuring a regulated active-input-termination and interpolation-based

single-ended-to-differential (S2D) converter, that achieves a long 0.2inch PD-TIA reach at 0.61pJ/b. In Paper 36. 7, University of

Illinois at Urbana-Champaign demonstrates a QAM-16 analog coherent receiver, fabricated in 28nm CMOS, which recovers 64Gb/s

data at 1.54pJ/b with a BER better than 1e-10. By using the weighted sum of phase errors extracted from the inner and outer symbols

of the QAM-16 constellation, the phase detector range is extended, and transition density is doubled, enabling a robust carrier phase

recovery. In Paper 36.9, Marvell presents a 212Gb/s PAM-4 re-timer with the integrated optical driver at 4.4pJ/b and chip-to-module

(C2M) capability of 40dB loss compensation at 2.76pJ/b, fabricated in 5nm FinFET technology. The optical driver achieves a swing

of 3Vppd with a bandwidth of 54GHz and a TDECQ of 2.04dB when driving a Mach-Zehnder Modulator. The C2M interface has a

measured BER of 1.8e-10 over a 40dB-loss channel.

Concluding Remarks:

As aggressive bandwidth scaling of wireline interconnects continues, meeting system power, cost, reliability and thermal budget

become extremely challenging. Relentless innovations in systems, circuit architectures and integrated photonics as well as advances

in packaging, interconnect topologies, and process scaling play a pivotal role in overcoming the limitations and pushing the

boundaries beyond today’s interconnects. ISSCC 2025 will highlight the most promising of these emerging technologies for electrical

and optical interconnects.

Figure 1: Per-lane data rate vs. year for a variety of common I/O standards.

Figure 2: Data-rate vs. process node and year.

100

1000

110 100

Data Rate [Gbps]

Process Node [nm]

2009

2010

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

Figure 3: Power efficiency vs. channel loss and year.

0.10

1.00

10.00

100.00

1000.00

020 40 60

Power Efficiency [mW/Gbps]

Channel Loss at Nyquist [dB]

Other

ISSCC 2025

HISTORICAL TRENDS IN TECHNICAL THEMES

DIGITAL SYSTEMS

DIGITAL ARCHITECTURES & SYSTEMS SUBCOMMITTEE

DIGITAL CIRCUITS SUBCOMMITTEE

MEMORY SUBCOMMITTEE

SECURITY SUBCOMMITTEE

Digital Architectures & Systems (DAS) – 2025 Trends

Subcommittee Chair: Rahul Rao, IBM, Bangalore, India

This year continued the trend of fewer dies in more stable packaging. Chiplet-based disaggregation continues to help keep down costs

through yield improvements. Technology scaling packed more transistors within the same die area and microarchitectural improvements

ensured those transistors translated to gains in instruction per cycle (IPC) and accelerating AI workloads. Total cache sizes continue to

increase from SRAM density improvements, further lifting application-level performance. Robustness and reliability in processors is

improved through metal capacitor short detection, cache RAID protection, and fine-grained self-healing of memories. An emphasis on

I/O and data processing units is also seen in this year's papers increasing throughput for network and edge server use cases.

Figure 1: Core-count and die-count trends (red diamond designates multi-chip module).

Figure 2: Chip-complexity scaling trends in terms of transistor count and total cache-size (red diamond designates multi-chip module).

Edge computing saw an increased focus on AI for video applications, including multiple papers on super resolution (SR) for low-

bandwidth, high-resolution streaming applications and Gaussian splatting for real-time immersive experiences. While super resolution is

not new for this year, there is continued focus in pushing SR energy efficiency to achieve high resolution and frames-per-second, while

maintaining low bandwidth usage. Real-time photorealistic rendering in Gaussian splatting processors also made a strong appearance

this year. Together, both technologies make strides towards real-time video in virtual and augmented reality, embodied AI, and immersive

display applications. Finally, safety of autonomous vehicle and robotics applications is improved at edge through small-object detection

in high-resolution videos.

Figure 3: Application-processor trends in handheld devices.

With deep neural networks (DNNs) now extensively deployed across a wide range of application domains and market segments, including

wearables, mobile devices, edge and servers, DNN models are also continuously evolving to meet diverse application demands. These

DNN models vary significantly in size and computational complexity, creating a growing demand for highly efficient and high-performance

neural-network computing chips. The recent trend toward focusing on machine-learning accelerators for applications such as rendering,

robotics, and image generation continues to evolve, expanding the range of model architectures, including convolutional neural networks,

transformers, diffusion models, and spiking neural networks (SNNs) as shown in Figure 4.

Hardware accelerators provide critical optimizations that balance energy efficiency, performance, and accuracy across various DNNs.

Key system-level metrics remain energy-per-inference (or per-training-example), inferences/second (or training-examples/s), utilization,

and operations per mm² on a specific task at a given inference (or final trained) accuracy for both dense and sparse levels of sparsity.

With the rise of generative AI, accelerators are adapting to meet the growing computational demands of tasks like image generation and

large language models. These workloads require efficient memory handling and high computational throughput, driving advancements

in hardware designs aimed at improving overall efficiency while maintaining accuracy. A major focus this year is on reducing external

memory access (EMA), a key factor in improving both performance and energy efficiency, particularly for large models such as

transformers used in generative AI. Techniques like model compression and on-chip memory enhancements have become essential in

minimizing EMA-related bottlenecks.

Figure 4: Various parameters impacting low-level and system-level benchmarking metrics.

At ISSCC, several key emerging trends in machine learning and AI are shaping the landscape:

1. Energy Efficiency and Performance: A major focus this year is on improving energy efficiency across various machine learning

tasks, particularly in generative AI and large models like transformers. Metrics such as energy per inference and inferences per

second remain central to evaluating accelerators, with this year’s designs pushing the boundaries of efficiency through

techniques like optimized scheduling, depth-first processing, and compute-memory integration. These improvements allow

hardware to handle increasingly complex models while keeping energy consumption low, driving better performance at reduced

power levels.

2. Memory Efficiency and Model Compression: As model sizes grow, reducing external memory access (EMA) has become

more critical than ever. This year’s papers showcase significant advancements in algorithmic model compression and memory-

efficient designs. Techniques like quantization—specifically binary and ternary—and optimized bit allocation are helping to

minimize EMA, leading to lower energy consumption and more efficient memory usage. These strategies are essential for

managing the high data throughput required by large-scale applications like generative AI.

3. Domain-Specific and Hybrid Accelerators: Tailored, domain-specific accelerators are increasingly being used for applications

ranging from real-time processing to 3D point-cloud analysis. This year’s designs emphasize hybrid architectures that combine

multiple model types, such as CNNs and transformers, to optimize both performance and energy efficiency. Techniques like

layer fusion, hybrid attention mechanisms, and task-specific pruning are enabling these accelerators to handle the diverse

computational needs of modern machine learning workloads effectively.

4. Exploiting Sparsity and Hardware-Software Co-Design: Sparsity exploitation continues to be a key method for reducing

computational complexity and improving efficiency. This year’s designs leverage sparsity-aware techniques, such as

binary/ternary quantization and look-up tables, to avoid unnecessary computations. Hardware-software co-design is also

becoming more prevalent, allowing accelerators to optimize data flow and memory use in an integrated manner, resulting in

highly efficient processing across various tasks.

Figure 5: Heterogeneity in deep-learning accelerators.

Integer

Floating-point

Logarithmic

Custom Arbitrary Bases

Mixed-Precision

Digital CMOS

Compute-In-Memory

(SRAM, RRAM, eDRAM)

Analog/Mixed signal

CNN core

RNN core

Depthwise-Separable core

Transformer core

Heterogeneous

SoC

Sparse accelerator

Dense accelerator

With the increased diversity in machine-learning accelerators aimed at different applications and sparsity, a fair comparison across these

designs for true system-level benchmarking metrics (e.g. energy/inference or inferences/s) is challenging. Hence, we look at reported

metrics of operations/s and energy/operation within the neural network where possible. Figure 6 plots the evolution of both energy

efficiency and area efficiency (throughput-per-unit-area) over the past few years. Though it is difficult to draw definitive trends from these

graphs, it does indicate a drive towards higher efficiency at higher bit-precisions. It should be noted that these metrics (TOPS/W and

TOPS) are highly dependent on the topology being used, and the sparsity of the associated data set. While attempts have been made

to establish a benchmarking methodology that can properly account for the application context [1], its adoption has been limited. In the

meantime, clever combinations of sparsity, variable precision and in-memory computing technologies are continuing to enhance deep-

learning processor efficiency and throughput.

Figure 6: Evolution in energy efficiency (TOPS/W) and throughput per unit area (TOPS/mm2) of ML inference processors.

[1] G. W. Burr, S. Lim, B. Murmann, R. Venkatesan and M. Verhelst, "Fair and Comprehensive Benchmarking of Machine

Learning Processing Chips," in IEEE Design & Test, vol. 39, no. 3, pp. 18-27, 2022.

Digital Circuits – 2025 Trends

Subcommittee Chair: Huichu Liu, Meta, Sunnyvale, CA

The escalating demand for energy-efficient platforms, from low-power wake-up sensors to AI accelerators, is fueling ongoing innovations

in CMOS digital design. This year's ISSCC papers demonstrate continued progress in addressing key trends, including the need for co-

optimized design and technology, overcoming memory bottlenecks in large language model accelerators, and expanding silicon

technology into novel form factors like textiles. Additionally, the conference highlighted advancements in clocking, power management,

3D heterogeneous integration, and the growing use of runtime regression and machine learning techniques in digital circuits.

Digital Clocking Circuits for Low-Jitter Applications: Large multi-domain SoCs require efficient generation of multiple clock domains

capable of low-voltage operation, and fine-grained output frequency control with low jitter. Figure 1 presents the trade-off between power

and jitter captured into a composite figure of merit (FoM) for different types of clock generators. Frequency output dividers (FODs)

continue to grow in popularity, each producing fractional clock frequencies from a single high-frequency clock source. Digital-to-time

converters have been used to minimize spurious tones (spurs) created by reference sampling or fractional division in multi-modulus

dividers, and this year’s ISSCC publications continue this trend. Circuit imperfections the form of gain and integral non-linearity (INL) are

suppressed, reducing spurs while maintaining energy efficiency and realizing competitive FoMs (Figure 2).

Figure 1. Advances in pareto-optimal performance between jitter and power in digital clock generators

along with key figure of merit (FoM) across recent years, defined as: FoM = 10  log10{ (JitterRMS/1s)²  (Power/1mW) },

showing fractional ratio clock sources now reaching the same performance as integer ratio clock sources.

Figure 2. Joint improvement of spur rejection and FoM in recent digital clock generators:

spur suppression is becoming the main challenge to improve jitter and reduce FoM.

Integrated voltage regulators (VRs) and power-delivery designs: The relentless pursuit of power-constrained performance in scaled

technologies has intensified challenges in power delivery and point-of load voltage regulation. Supply noise and VR transient response

remain significant contributors to system inefficiency. Moreover, 3D integration introduces additional constraints on power distribution

networks (PDNs) and decoupling capacitance resources. To address these challenges, this year's ISSCC papers showcase control

techniques aimed at enhancing transient response and simplifying control of increasingly complex multi-domain hierarchical power

delivery architectures. Run-time control for power management continues to gain widespread adoption. Notable techniques include

adjustment of supply voltage and droop mitigation thresholds at run time, and reconfigurable VR fabrics to enable more energy efficient

dynamic voltage and frequency scaling (DVFS). Power management systems that rely on run-time load-current sensing to ensure load

current compliance are also reported.

Advances in digital circuits for 3D heterogeneous integration (3DHI): 3DHI is driving the need for solutions offering modular system

construction addressing a diverse range of needs in I/O, power delivery and memory bandwidth. The more complex and challenging

thermal environment resulting from 3DHI is expected to require effective, scalable thermal management solutions to maximize system

performance without compromising reliability. This year’s papers describe compact, DVFS friendly thermal sensors, thermal profiling

measurements, and active interposers targeted toward enabling increasingly modular 3DHI construction.

Widening use of regression and ML to augment circuit capabilities: As circuit design challenges continue to mount with each

technology node, a notable trend reflected in this year’s ISSCC papers is a growing adoption of regression and ML based techniques to

augment circuit techniques to improve area, power and performance while reducing circuit complexity. This year’s papers demonstrate

the use of these techniques for more compact, scalable thermal sensors, and load-current sensors to enable more efficient system-wide

power management.

Memory – 2025 Trends

Subcommittee Chair: Meng-Feng Chang, National Tsing Hua University

The demand for high-speed, high-density, and low-energy memory systems is rapidly growing due to developments in deep-learning (DL)

and machine-learning (ML) artificial intelligence (AI). Furthermore, the role of embedded memories and compute-in-memory (CIM) is

becoming increasingly significant in high-performance systems.

This year, the highest-density 64Gb DDR4, based on STT-MRAM, is introduced. Two 1Tb 3b/cell 3D-NAND-Flash memories are

demonstrated: with the highest density based on 4xx-layer, improvements to read energy efficiency, and a 4.8 and 5.6Gb/s/pin IO speed.

The first 24Gb GDDR7 achieves a 42.5Gb/s/pin IO speed with a dual-emphasis TX. A 38Mb/mm2 dual-rail SRAM in 3nm-FinFET

technology and a 37.9Mb/mm2 SRAM in a 2nm CMOS-nanosheet technology are introduced. A 16nm 216kb microscaling, integer or

floating-point multi-mode gain-cell CIM macro demonstrates a power-efficiency improvement.

Meta

6.4

Myongji University

6.3

Nanjing Electronic Device Institute

33.2

Nanjing Low Power IC Technology Institute

11.4

Nanjing University

9.6

Nanjing University of Science and Technology

34.5

Nano Core Chip Electronic Technology

13.1, 37.6

National Center of Technology Innovation for EDA

14.3, 14.7

National Cheng Kung University

3.1

National Institute for Materials Science

20.1

National Key Laboratory of Solid-State Microwave Devices and Circuits

33.2

National Taiwan University

2.8, 15.1, 17.3, 20.3

National Tsing Hua University

2.5, 2.9, 7.6, 14.1, 14.2, 15.4, 20.2

National University of Defense Technology

8.6

National University of Singapore

17.1

National Yang Ming Chiao Tung University

32.3, 32.6

Nautilus Defense LLC

37.2

New York University Abu Dhabi

28.1, 28.2

now with Kandou Bus SA

34.3

now with Marvell

36.3

Nvidia

17.6

Oregon State University

25.2

Osaka University

20.1

Peking University

6.6, 7.3, 7.7, 10.5, 13.1, 18.2, 18.3, 18.4, 18.5, 18.6,

23.4, 25.5, 28.3, 36.2, 37.5, 37.6

Pohang University of Science and Technology

35.7

Politecnico di Milano

34.2, 34.3

Princeton University

25.3

Purdue University

12.2, 20.11

Quobly

13.5

Realtek Semiconductor

32.3, 32.6

Reconova Technologies

23.4

Rensselaer Polytechnic Institute

33.5

Rice University

26.4, 34.4, 35.2, 35.4

SambaNova Systems

16.4

Samsung Advanced Institute of Technology

37.3

Samsung Electronics

5.4, 6.1, 6.10, 6.3, 6.9, 8.4, 9.8, 10.4, 16.3, 17.4,

21.2, 21.6, 22.4, 30.1, 30.3, 30.4, 35.6

Samsung Research America

9.3

Samsung Semiconductor

37.3

Sandia National labs

25.4

Seoul National University

8.5, 22.1, 22.2, 22.3, 34.1, 37.3, 37.7

Sesame AI

6.4

SGR Semiconductors

5.2

Shanghai Institute of Technical Physics Chinese Academy of Sciences

6.6

Shanghai Jiao Tong University

4.2, 5.2, 20.6

Silan Microelectronics

27.1

SK hynix

22.3, 30.5, 30.6

Sogang University

6.2, 9.5, 15.7, 21.6

SolidVue

6.2

Sony Semiconductor Solutions

6.5

South China University of Technology

4.2, 19.5

Southeast University

8.6, 11.4, 14.3, 14.6, 14.7

Southern University of Science and Technology

9.10, 13.4, 17.1, 34.5, 36.6, 36.8

Stanford University

12.4, 15.2

Sungkyunkwan University

6.10, 6.2

Synopsys

29.4

Taiwan Semiconductor Research Institute

2.5, 32.3, 32.6

Technical University Munich

4.4

Technische Universität Braunschweig

13.3

Teledyne DALSA Semiconductors

4.1

Teledyne e2v

4.1

Texas Instruments

25.3

Tianjin University

10.2, 33.2

Tokyo Institute of Technology

5.6, 11.1

Toronto Western Hospital

15.4

Tsinghua University

2.6, 3.4, 4.3, 5.1, 9.1, 9.10, 9.9, 14.3, 14.4, 17.2,

18.7, 19.9, 20.2, 20.5, 21.8, 21.9, 23.8, 24.1, 24.7,

26.1, 27.2, 28.4

TSMC

4.6, 14.1, 29.1, 29.5, 36.1

TSMC Corporate Research

2.9, 14.1, 14.2

TSMC Design Technology Japan

29.5

TU Delft

27.2, 27.4

Ulsan National Institute of Science and Technology

6.2, 15.7

UM Hetao IC Research Institute

9.9, 21.8, 21.9

University College Dublin

19.4

University College London

35.3

University of Bonn

37.1

University of California

4.2, 5.10, 9.2, 9.3, 12.1, 15.7, 19.6, 20.10, 20.12,

20.3, 20.9, 21.7, 22.1, 25.2, 35.8

University of Chinese Academy of Sciences

6.6, 14.5, 19.8

University of Electronic Science and Technology of China

4.5, 5.3, 5.7, 13.4, 18.1

University of Hawaii

15.2

University of Illinois

36.7

University of Macau

4.5, 9.1, 9.10, 9.9, 14.5, 19.1, 19.7, 20.8, 21.3, 21.8,

21.9, 24.3, 24.4, 24.5, 27.5, 31.4, 32.1

University of Michigan

25.1, 26.3

University of Minnesota-Twin Cities

9.3

University of Notre Dame

15.3

University of Pavia

33.3

University of Pennsylvania

5.2, 15.4

University of Pittsburgh

15.6

University of Science and Technology of China

9.11, 9.4, 32.4, 32.5

University of Southern California

11.2, 20.1, 26.1

University of Texas

5.11

University of Tokyo

13.6, 19.2

University of Toronto

15.4, 15.5

University of Tsukuba

20.1

University of Twente

4.1

University of Ulm

17.5

University of Virginia

37.2

University of Waterloo

11.2, 26.1

University of Zurich and ETH Zurich

13.2

Vango Technologies

27.3

Washington University in St. Louis

25.2

Western Digital

30.2

Wuxi Micro Innovation Integrated Circuit Design

17.2

Xi'an JiaoTong University

7.7, 7.9, 9.7, 36.5

Xidian University

5.9, 7.8, 19.11, 24.2, 27.1

XINYI Information Technology

10.5

XINYI Semi

10.5

XO Semiconductor

6.3

Yonsei University

6.3

Zhejiang University

21.1, 26.5, 27.3

Contact Information

CONTACT INFORMATION

ANALOG

Subcommittee Chair: Viola Schaffer

Texas Instruments

Work Phone: +49-8161-802943

Email: schaffer_viola@ti.com

Press Designates: Qiang Li

Chinwuba Ezekwe

DATA CONVERTERS

Subcommittee Chair: Jan Westra

Broadcom

Work Phone: +31-30-6084517

Email: jan.westra@broadcom.com

Press Designates: Xiyuan Tang

Shahrzad Naraghi

DIGITAL ARCHITECTURES & SYSTEMS

Subcommittee Chair: Rahul Rao

IBM India

Work Phone: +91-80-406-61261

Email: rahulmrao@in.ibm.com

Press Designates: Nathaniel Pinckney

Ji-Hoon Kim

DIGITAL CIRCUITS

Subcommittee Chair: Huichu Liu

Meta

Work Phone: 917-328-5513

Email: huichu@meta.com

Press Designates: Ping-Hsuan Hsieh

Carlos Tokunaga

Yvain Thonnart

IMAGERS, MEDICAL & DISPLAYS

Subcommittee Chair: Rikky Muller

University of California, Berkeley

Work Phone: +1-617-519-8508

Email: rikky@berkeley.edu

Press Designates: Mutsumi Hamaguchi

Carolina Mora Lopez

Augusto Ximenes

MEMORY

Subcommittee Chair: Meng-Fan Chang

National Tsing Hua University

Work Phone: +886-900846406

Email: mfchang@ee.nthu.edu.tw

Press Designates: Saekyu Lee

Hye-Ran Kim

POWER MANAGEMENT

Subcommittee Chair: Bernhard Wicht

University of Hannover

Work Phone: +49-511-762-19690

Email: bernhard.wicht@ims.uni-hannover.de

Press Designates: Hanh-Phuc Le

Mo Huang

Subcommittee Chair: Brian Ginsburg

Texas Instruments

Work Phone: 214-567-6311

Email: bginzz@ti.com

Press Designates: Omeed Momeni

Hiroshi Hamada

Mina Shahmohammadi

SECURITY

Subcommittee Chair: Ingrid Verbauwhede

KU Leuven

Work Phone: +32-16-32-86-25

Email: ingrid.Verbauwhede@esat.kuleuven.be

Press Designates: Rabia Tugce Yazicigil

Utsav Banerjee

TECHNOLOGY DIRECTIONS

Subcommittee Chair: Ali Hajimiri

Caltech

Work Phone: +1-626-487-8463

Email: hajimiri@caltech.edu

Press Designates: Daniel Morris

Jacques "Chris" Rudell

WIRELESS

Subcommittee Chair: Chih-Ming Hung

MediaTek

Work Phone: +1 512-887-2080

Email: hungcm@hotmail.com

Press Designates: Wu-Hsin Chen

Yuanjin Zheng

WIRELINE

Subcommittee Chair: Thomas Toifl

Cisco Systems

Work Phone: +41-76-527-1304

Email: ttoifl@cisco.com

Press Designates: Mozhgan Mansuri

Wei-Zen Chen

Program Chair, ISSCC 2025

Tom Burd

Phone: +1-408-749-2805

Email: tom.burd@amd.com

Program Vice-Chair, ISSCC 2025

Keith Bowman

Phone: +1-919-237-4750

Email: kbowman@qti.qualcomm.com

Press Coordinator

Shahriar Mirabbasi

University of British Columbia

Email: shahriar@ece.ubc.ca

Press-Relations Liaison

Laura Fujino

University of Toronto

Work Phone: 416-418-3034

Email: lcfujino@aol.com

5 views·152 pages

ISSCC 2025 PRESS KIT PDF Free Download

ISSCC 2025 PRESS KIT PDF free Download. Think more deeply and widely.

Uploaded by jgomez on 2/4/2026

/152

100%

2025

PRESS KIT

DRAFT 2-1-2025

ISSCC Press Kit Disclaimer

The material presented here is preliminary.

As of November 5, 2024, there is not enough information to guarantee its correctness.

Thus, it must be used with some caution.

ISSCC 2025

VISION STATEMENT

The International Solid-State Circuits Conference is the foremost global forum for presentation

of advances in solid-state circuits and systems-on-a-chip. The Conference offers a unique

opportunity for engineers working at the cutting edge of IC design and use to maintain

technical currency, and to network with leading experts.

© COPYRIGHT 2025 ISSCC—DO NOT REPRODUCE WITHOUT PERMISSION 
Table of Contents 
Table of Contents ............................................................................................................................................ 4 
Preamble .......................................................................................................................................................... 7 
FAQ on ISSCC ...................................................................................................................................................................................... 7 
Plenary Session (Session 1) ............................................................................................................................................................. 10 
Plenary Session — Invited Papers ............................................................................................................... 12 
Plenary Session — Invited Papers ............................................................................................................... 13 
Evening Events (EE) ...................................................................................................................................... 15 
Bingo Networking Event ................................................................................................................................................................... 15 
EE1: Student Research Preview ....................................................................................................................................................... 15 
EE2: Quantum Computing: Whose Qubit is Better? ....................................................................................................................... 15 
EE3: Future of Analog Design: Still Magical or Mostly Digital? .................................................................................................... 16 
EE4: The Next Decade of AI – Barriers, Opportunities, & Directions ............................................................................................ 16 
Session Overviews and Highlights .............................................................................................................. 17 
Conditions of Publication ............................................................................................................................. 18 
PREAMBLE ......................................................................................................................................................................................... 18 
FOOTNOTE ......................................................................................................................................................................................... 18 
Session 2 Overview: Processors ................................................................................................................. 19 
Session 2 Highlights: Processors ................................................................................................................ 20 
Session 3 Overview: Amplifiers and Analog Front-Ends............................................................................ 22 
Session 3 Highlights: Amplifiers and Analog Front-Ends .......................................................................... 23 
Session 4 Overview: Analog Techniques .................................................................................................... 24 
Session 4 Highlights: Analog Techniques ................................................................................................... 25 
Session 5 Overview:  Front-End Circuits for High-Performance Transceivers ......................................... 26 
Session 5 Highlights:  Front-End Circuits for High-Performance Transceivers ....................................... 27 
Session 6 Overview: Imagers and Displays ................................................................................................ 28 
Session 6 Highlights: Imagers and Displays ............................................................................................... 29 
Session 7 Overview: Ultra-High-Speed Wireline ......................................................................................... 31 
Session 7 Highlights: Ultra-High-Speed Wireline ........................................................................................ 32 
Session 8 Overview: Digital Techniques for System Adaptation, Power Management and Clocking ..... 34 
Session 8 Highlights: Digital Techniques for System Adaptation, Power Management and Clocking ... 35 
Session 9 Overview: Ubiquitous Power Delivery ........................................................................................ 36 
Session 9 Highlights: Ubiquitous Power Delivery ...................................................................................... 37 
Session 10 Overview: Transceiver Chipsets for Communication and Radar ............................................ 38 
Session 10 Highlights: Transceiver Chipsets for Communication and Radar .......................................... 39 
Session 11 Overview:  RF and mm-Wave Wireless Receivers ................................................................... 40 
Session 11 Highlights:  RF and mm-Wave Wireless Receivers ................................................................. 41 
Session 12 Overview:  Innovations from Outside the (ISSCC) Box ........................................................... 42 

© COPYRIGHT 2025 ISSCC—DO NOT REPRODUCE WITHOUT PERMISSION 
Session 12 Highlights: .................................................................................................................................. 42 
Session 13 Overview: Cool Computation Circuits ...................................................................................... 44 
Session 13 Highlights: Cool Computation Circuits..................................................................................... 45 
Session 14 Overview: Compute-In-Memory ................................................................................................ 47 
Session 14 Highlights: Compute-in-Memory ............................................................................................... 48 
Session 15 Overview: Neural Interfaces and Edge Intelligence for Medical Devices ............................... 50 
Session 15 Highlights: Precision neural and cardiac chips ....................................................................... 51 
Session 16 Overview: Highlighted Chip Releases:  Digital and Machine Learning Processors .............. 53 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 54 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 55 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 56 
Session 16 Highlights: Highlighted Chip Releases:  Digital and Machine Learning Processors ............ 57 
Session 17 Overview: Hardware Security .................................................................................................... 58 
Session 17 Highlights: Hardware Security .................................................................................................. 59 
Session 18 Overview:  Noise-Shaping and SAR-Based ADCs ................................................................... 60 
Session 18 Highlights:  Noise-Shaping and SAR-Based ADCs ................................................................. 61 
Session 19 Overview:  Frequency Synthesizers and Series-Resonance VCOs ........................................ 62 
Session 19 Highlights:  Frequency Synthesizers and Series-Resonance VCOs ...................................... 63 
Session 20 Overview:  Sensors and Actuators for Health and Autonomy ................................................ 64 
Session 20 Highlights:  Sensors and Actuators for Health and Autonomy............................................... 65 
Session 21 Overview: Compute & USB Power ............................................................................................ 66 
Session 21 Highlights: Compute & USB Power .......................................................................................... 67 
Session 22 Overview: Memory Interface ...................................................................................................... 68 
Session 22 Highlights: Memory Interface .................................................................................................... 69 
Session 23 Overview: AI Accelerators ......................................................................................................... 70 
Session 23 Highlights: AI Accelerators ....................................................................................................... 71 
Session 24 Overview: High-Frequency ADCs ............................................................................................. 73 
Session 24 Highlights: High-Frequency ADCs ............................................................................................ 74 
Session 25 Overview: High Concepts at High Frequencies ....................................................................... 75 
Session 25 Highlights: High Concepts at High Frequencies ...................................................................... 76 
Session 26 Overview:  Wireless Transmitters and Front-Ends .................................................................. 77 
Session 26 Highlights:  Wireless Transmitters and Front-Ends ................................................................ 78 
Session 27 Overview: Sensor Interfaces ..................................................................................................... 79 
Session 27 Highlights: Scalable Sensor Interface ...................................................................................... 80 
Session 28 Overview: Capacitive Sensor Readout ..................................................................................... 81 
Session 28 Highlights: Capacitive Displacement Sensor ........................................................................... 82 
Session 29 Overview: SRAM ........................................................................................................................ 83 
Session 29 Highlights: SRAM ....................................................................................................................... 84 
Session 30 Overview: Nonvolatile Memory and DRAM .............................................................................. 85 
Session 30 Highlights: Nonvolatile Memory and DRAM ............................................................................. 86 

© COPYRIGHT 2025 ISSCC—DO NOT REPRODUCE WITHOUT PERMISSION 
Session 31 Overview: Energy Harvesting and IoT Power .......................................................................... 88 
Session 31 Highlights: Energy Harvesting and IoT Power ......................................................................... 89 
Session 32 Overview: Isolated Power and Gate Drivers ............................................................................. 90 
Session 32 Highlights: Isolated Power and Gate Drivers ........................................................................... 91 
Session 33 Overview: Components for Beyond 100GHz ............................................................................ 92 
Session 33 Highlights: Components for Beyond 100GHz .......................................................................... 93 
Session 34 Overview:  Digital PLLs and Waveform-Shaping VCOs .......................................................... 94 
Session 34 Highlights:  Digital PLLs and Waveform-Shaping VCOs ......................................................... 95 
Session 35 Overview: Implantable and Wearable Biomedical Devices ..................................................... 96 
Session 35 Highlights: Photoacoustic/Implant RXs.................................................................................... 97 
Session 36 Overview: Ultra-High-Density D2D  and High-Performance Optical Transceivers ................ 98 
Session 36 Highlights: Ultra-High-Density D2D  and High-Performance Optical Transceivers ............... 99 
Session 37 Overview: Design-Technology Optimization and Digital Accelerators ................................ 101 
Session 37 Highlights: Design-Technology Optimization and Digital Accelerators ............................... 101 
Trends .......................................................................................................................................................... 103 
Conditions of Publication ........................................................................................................................... 104 
PREAMBLE ....................................................................................................................................................................................... 104 
FOOTNOTE ....................................................................................................................................................................................... 104 
Analog – 2025 Trends.................................................................................................................................. 106 
Power Management – 2025 Trends ............................................................................................................ 108 
Data Converters – 2025 Trends .................................................................................................................. 110 
RF – 2025 Trends ......................................................................................................................................... 115 
Wireless – 2025 Trends ............................................................................................................................... 118 
Wireline – 2025 Trends ................................................................................................................................ 121 
Digital Architectures & Systems (DAS) – 2025 Trends ............................................................................. 127 
Digital Circuits – 2025 Trends ..................................................................................................................... 131 
Memory – 2025 Trends ................................................................................................................................ 133 
Security – 2025 Trends ................................................................................................................................ 137 
IMD – 2025 Trends (Medical) ....................................................................................................................... 139 
IMD – 2025 Trends (Imagers) ...................................................................................................................... 140 
IMD – 2025 Trends (Displays) ..................................................................................................................... 141 
Technology Directions – 2025 Trends ....................................................................................................... 142 
INDEX ........................................................................................................................................................... 143 
Technical Topics Mapped to Papers .................................................................................................................................................. 144 
Selected Presenting Companies/Institution Mapped to Papers ......................................................................................................... 144 
Contact Information .................................................................................................................................... 149