Monthly Archives: December 2024

Why Small Language Model matter

I just read the following article: https://medium.com/towards-data-science/your-company-needs-small-language-models-d0a223e0b6d9. I personally liked the following illustration: From a non technical view I was shocked by the following illustration: At least this is why SML will be the next step.

Posted in Uncategorized | Leave a comment

The Seq2Seq issue history repeats for transformer

Classical sequence-to-sequence (Seq2Seq) models based on Recurrent Neural Networks (RNNs) condense the entire input sequence into a single fixed-length context vector. While effective for short sequences, they struggle with longer or more complex inputs due to their inability to capture … Continue reading

Posted in Uncategorized | Leave a comment

DORA’s automotive quick assessment

The DORA Quick Check is a streamlined assessment tool designed to measure your team’s software delivery performance. It focuses on four key metrics: Explore more about it here: DORA Quick Check. A first try for usual automotive cases:

Posted in Uncategorized | Leave a comment

Different perspectives on n-tier architecture

A common pattern for solution architecture are different layer (aka tiers) e.g, frontend, backend and database. The common approaches such as micro-service and self contained (aka CSC- self contained system) are just different kinds of commonly used layer.

Posted in Uncategorized | Leave a comment

Thoughts on automotive micro-service architectures

Micro-service architecture is a contemporary software design approach widely adopted by companies such as Netflix and Amazon, allowing them to release features at an impressive pace. This approach decomposes a large application into a collection of small, loosely coupled services. … Continue reading

Posted in Uncategorized | Leave a comment

Takeaways from the book “Software Engineering at Google: Lessons Learned from Programming Over Time”

My takeaways from the book “Software Engineering at Google: Lessons Learned from Programming Over Time” can be grouped into cultural, organizational, and technical insights. Open communication and psychological safety are emphasized as critical components of a successful engineering culture. These principles foster … Continue reading

Posted in Uncategorized | Leave a comment

Technology Management principles

As a technology manager, navigating new solution spaces to improve technology adoption, mitigate risks, and discover opportunities can be a complex task. The TRIZ (Theory of Inventive Problem Solving) methodology provides a structured approach to evaluate existing technological solutions from … Continue reading

Posted in Uncategorized | Leave a comment

Why KAN networks

Traditional neural network design relies heavily on gradient flow, which refers to the movement of gradients during the training process. A gradient is a value (vector or scalar) that represents the direction and magnitude of the steepest increase of a … Continue reading

Posted in Uncategorized | Leave a comment

Model size reducing via activity pruning

Gradient flow describes the movement of gradients during the training of a machine learning model. A gradient is a value (vector or scalar) that shows the direction and magnitude of the steepest increase of a function, indicating how a function … Continue reading

Posted in Uncategorized | Leave a comment

Multi GPU training

Different approach to achieve parallelism in training deep neural networks: Any better idea?

Posted in Uncategorized | Leave a comment