Skip to main content

Local Setup: Small Language Model vs. Quantized Large Model

About

LinkedIn

At present, Terrence Kim holds a dual position at Vanguard as the Head of Fixed Income Innovation of both Investment

Management Fintech Strategy and Vanguard Fintech Ventures, LLC. 


He is at the helm of a diverse team comprising data scientists, machine learning engineers, and

fintech ventures/product management professionals. Their mission is to delve into innovative technologies

with the potential to significantly boost investment returns and optimize capital markets for the benefit of

all investors. Terrence plays a pivotal role in instilling a culture of innovation throughout Vanguard.


In the past, Terrence Kim held the position of Global Business Manager, pioneering the global business

development and product management of multi-asset Trading API products (including EMSX API, IOI API,

RANK AIP, and etc.) at Bloomberg L.P. HE was instrumental in launching real-time data products such as

Custom VWAP, Money Market Service, Real-Time Volatilities, Liquidity Analytics for Portfolio, and

Crypto/Digital Asset service over B-PIPE.


Terrence embarked on his career as a Trader in 1995, specializing in grain and energy commodities.

By 2000, he was spearheading the development and implementation of the FIX protocol and electronic trading

at Wellington Management. His responsibility also included the construction of a next-generation proprietary

multi-asset order management system and reporting system.


Terrence’s experience extends to the sell-side, with roles at WestLB Panmure in London and the launch and

operation of the multi-asset FIXHub business for State Street Global Markets. He also led global Product

Management for ITG Net, the broker-neutral division of ITG Inc. Prior to his tenure at BLoomberg,

Terrence served as Director of Product Management for Complex Event Processor start-up StreamBase Systems,

before its acquisition by TIBCO.

Comments

Popular posts from this blog

2024 Progress...

My team has made considerable advancements in applying various emerging technologies for IMG (Investment Management Group). Predictive Models We have transitioned from conventional methods and refined our approach to using alternative data to more accurately predict the CPI numbers. Our initial approach has not changed by using 2 models (top-down & bottoms-up) for this prediction.   So far we have outperformed both our larger internal team and major banks and dealers in accurately predicting the inflation numbers. Overall roughly 80% accuracy with the last 3 month prediction to be right on the spot.  We have also developed predictive analytics for forecasting prepayment on mortgage-backed securities and predicting macroeconomic regime shifts. Mixed Integer Programming  / Optimization Another area of focus is on numerical optimization to construct a comprehensive portfolio of fixed-income securities for our ETFs and Mutual Funds. This task presents ...

What matters?

 What matters? Six things that matter in LLM in July 2024. 1) Scale of the model, number of parameters: Scale with brute force alone won't work. But the scale does matter depending on the overall goal and the purpose of what the LLM is trying to solve.   2) Compute matters: Even more than ever, we need to look at the infrastructures around LLMs. Infrastructure is also one of the main constraints for the near term and strategically provides an advantage to a few Middle East countries. 3) Data, quality & quantity. It remains true that high-quality data with extensive (longer) training is the way. Quantity of the data also matters. 4) Loss function matters: If your loss function isn't sophisticated or incentivizes the "right" thing, you will have limited improvement. 5) Symmetry or architecture: Do you have the correct architecture around your model(s) and data? Inefficient engineering can be costly to the overall performance and output. There are inherent struc...

Gemma 3 - Quick Summary & Why this matters

Introduction Despite being labeled the laggard in the language model race behind OpenAI and Anthropic, Google holds two decisive advantages in 2025's evolving AI landscape: unparalleled high-quality data reserves and compute infrastructure that dwarfs even Meta's formidable 600,000 H100 GPUs. As pre-training scaling laws plateau, these assets become critical differentiators. This is especially important in 2025 when everyone is looking for the killer application that can legitimize the research on language models. Combined with DeepMind's elite research talent and visionary leadership, Google possesses a power that competitors ignore at their peril. Gemma is a family of open-weight large language models (LLMs) developed by Google DeepMind and other teams at Google, leveraging the research and technology behind the Gemini models. Released starting in February 2024, Gemma aims to provide state-of-the-art performance in lightweight formats, making advanced AI accessible f...