Inference with Gemma using Dataflow and vLLM
vLLM's continuous batching and Dataflow's model manager optimizes LLM serving and simplifies the deployment process, delivering a powerful combination for developers to build high-performance LLM inference pipelines more efficiently.
JUG Milano 2024
JUG Milano 2024 On Monday, I visited JUG Milano together with my colleague Rosaria Rossini. She did a short talk about soft and hard skills needed for Open Source titled Java & Skills: a research activities in research@eclipse. After that, I did a talk...
Docker tutorial: Get started with Docker volumes
Docker containers are meant to be immutable, meaning the code and data they hold never change. Immutability is useful when you want to be sure the code running in production is the same as the code that passed QA testing; it’s not so useful when you...
Kotlin for Java developers
After Java, Kotlin is the most popular JVM language. Kotlin is an expressive, concise language with strong functional programming support. It’s especially appealing to Java developers because it is fully interoperable with Java, and the syntax is a...
The Agile Manifesto was ahead of its time
Last week I went on a short rant about scrum, and how it isn’t agile and is, well, dumb. It occurred to me that one of the obstacles to truly being agile has been the limits of software technology. The way we had to build and deliver software didn...
Visual Studio 17.12 brings C++, Copilot enhancements
Visual Studio 2022 17.12, the latest version of Microsoft’s signature integrated development environment (IDE), is now generally available, with improvements ranging from productivity enhancements for C++ developers to integrated AI variable inspect...
Go language rises in Tiobe popularity index
The Google-built Go language has reached an all time high position in the Tiobe index of programming language popularity, ranking seventh in the November 2024 edition of the index. While Go already had cracked Tiobe’s top 10, the language had never...
Red Hat Developer Hub adds AI templates
Red Hat Developer Hub, an enterprise-grade internal developer portal for platform engineering teams, is adding enhancements for AI, with five templates for building AI-powered applications for common use cases. Rolled out on November 12, the AI-focuse...
Why your AI models stumble before the finish line
In 2023, enterprises across industries invested heavily in generative AI proof of concepts (POCs), eager to explore the technology’s potential. Fast-forward to 2024, companies face a new challenge: moving AI initiatives from prototype to production....
Is your software architecture as clean as your code?
Modern software must function smoothly within a diverse ecosystem, from on-premises monoliths to ever-evolving cloud-based microservices. Architectural choices made during software development, be they explicit or implicit, add complexity and create i...
Can Wasm replace containers?
Heralded as the fourth standard of the web, WebAssembly, or Wasm for short, has sparked heated discussion since its origins. Wasm is an assembly-like programming language, a compact binary format, and a compilation target for C, C++, C#, Go, JavaScrip...