Tag Archives: opensource

Waffles – command-line tools for machine learning and data mining

Waffles seeks to be the world’s most comprehensive collection of command-line tools for machine learning and data mining. Our native tools have minimal dependencies (no interpreter, VM, or runtime environment is necessary), and build cross-platform. If you have a useful data mining tool that meets these criteria, we want it in Waffles.

(Full Story: Waffles – command-line tools for machine learning and data mining)

Black Duck – Open Source, Abundance and Open Innovation

With open source comes abundance — more than 500,000+ projects are freely available today, and that number’s growing rapidly. Clearly the confluence between developers, communities and businesses turning to open source has provided a way for enterprises to grow and speed development, even with constrained resources.

(Full Story: Black Duck – Open Source, Abundance and Open Innovation)

Evaluating Text Extraction Algorithms

According to my evaluation setup and personal experience, the best open source solution currently available on the market is the boilerpipe library. If we treat precision with an equal amount of importance as recall (reflected in the F1 score) and take into account the performance consistency across both domains, then boilerpipe performs best. Performance aside, its codebase is seems to be quite stable and it works really fast.

(Full Story: Evaluating Text Extraction Algorithms)

OPEN – a tool for teachers to create and share lessons with their studenrs

OPEN, a free tool used by teachers to create digital lessons for their own students. Instead of learning content in the classroom, students will take lessons created by their own teacher at home and spend the class period working on homework-style problems.

(Full Story: OPEN – a tool for teachers to create and share lessons with their studenrs)

How Linux mastered Wall Street | ITworld

The largest exchange, the New York Stock Exchange (NYSE) Euronext, is run on a Linux system that can generate 1,500,000 quotes and process 250,000 orders every second, offering acknowledgments of each transaction within two milliseconds.

Red Hat Enterprise Linux is now the dominant Linux distribution among exchanges, Lameter said. Red Hat counts among its customers the Chicago Mercantile Exchange, New York Mercantile Exchange, Frankfurt Stock Exchange, Eurex derivative exchange and Philippine Stock Exchange.

(Full Story: How Linux mastered Wall Street | ITworld)

Introducing Apache Mahout – calable, commercial-friendly machine learning for building intelligent applications

Summary:  Once the exclusive domain of academics and corporations with large research budgets, intelligent applications that learn from data and user input are becoming more common. The need for machine-learning techniques like clustering, collaborative filtering, and categorization has never been greater, be it for finding commonalities among large groups of people or automatically tagging large volumes of Web content. The Apache Mahout project aims to make building intelligent applications easier and faster. Mahout co-founder Grant Ingersoll introduces the basic concepts of machine learning and then demonstrates how to use Mahout to cluster documents, make recommendations, and organize content.

(Full Story: Introducing Apache Mahout – calable, commercial-friendly machine learning for building intelligent applications)

OPEN – a tool for teachers to create and share lessons with their studenrs

OPEN, a free tool used by teachers to create digital lessons for their own students. Instead of learning content in the classroom, students will take lessons created by their own teacher at home and spend the class period working on homework-style problems.

(Full Story: OPEN – a tool for teachers to create and share lessons with their studenrs)

How Linux mastered Wall Street | ITworld

The largest exchange, the New York Stock Exchange (NYSE) Euronext, is run on a Linux system that can generate 1,500,000 quotes and process 250,000 orders every second, offering acknowledgments of each transaction within two milliseconds.

Red Hat Enterprise Linux is now the dominant Linux distribution among exchanges, Lameter said. Red Hat counts among its customers the Chicago Mercantile Exchange, New York Mercantile Exchange, Frankfurt Stock Exchange, Eurex derivative exchange and Philippine Stock Exchange.

(Full Story: How Linux mastered Wall Street | ITworld)

google-refine – Google Refine, a power tool for working with messy data (formerly Freebase Gridworks) – Google Project Hosting

Google Refine is a power tool for working with messy data, cleaning it up, transforming it from one format into another, extending it with web services, and linking it to databases like Freebase.

(Full Story: google-refine – Google Refine, a power tool for working with messy data (formerly Freebase Gridworks) – Google Project Hosting)

Open – A tool for teachers to create and share lessons with their students.

Open, a free tool used by teachers to create digital lessons for their own students. Instead of learning content in the classroom, students will take lessons created by their own teacher at home and spend the class period working on homework-style problems

(Full Story: Open – A tool for teachers to create and share lessons with their students.)

Follow

Get every new post delivered to your Inbox.