iSnare.com - Free Content Articles Directory
Authors Contents [Advanced Search][Add OpenSearch][Job Search]
Distribute your articles to thousands of article sites for only $2 and below! Read more...

Index  Computers and Technology
 

Understanding Support Vector Machines (SVMs) Classifiers

 
[ Contact the Author] [ Send to a Friend] [ Article Publisher] [Make PDF] [ Print] [ Bookmark & Share]
 
Read our Terms of Service before reprinting this article. The submitter specified above has claimed the rights to this article.
Danny Wirken

The past couple of years witnessed the increased applications of statistical methods in different fields and for different purposes. These differences made the deficiencies of the existing methods apparent. However, it was not until the Internet became a hit in 1990 that the dissatisfaction with the then current statistical methods considerably grew since the methods are proving to be more and more disadvantageous. This eventually incited the diligent search for a more innovative statistical approach that can be used in classifying large amounts of information.

In the early 1990s, Vladimir Vapnik along with a group of other mathematicians and scientists developed a new statistical approach that is more efficient particularly in dealing with large classification problems. This new approach was called “Support Vector Machines” (SVM).

What are Support Vector Machines you ask? This is a mathematical procedure that makes it possible to teach a computer to classify large amounts of data. The results are said to be more reliable compared to using the old statistical methods. A support vector machine is an approach for building functions from a set of labeled training data.

To fully understand how a support vector machine works, it is imperative to also understand some basic factors first. Classification is normally associated with training and testing data that is made of certain data instances. Each instance in the training set hold one "target value" (class labels) and numerous "attributes" (features). The main objective of a support vector machine is to create a model that calculates target value of data instances in the testing set that are only given to attributes.

A support vector machine has two main functions. The first one is that it can be a classification function (wherein the output is binary: while the input is in a category). Meanwhile, the second function is that it can simply be a general regression function.

With regards to the classification function of support vector machines, it basically works by searching a hyper surface in the space of possible inputs. This hyper surface will then try to split the positive examples from the negative ones. The split will be selected to have the largest distance from the hyper surface to the nearest of the positive and negative examples. Naturally, this would make the classification accurate for testing data that is near, though a slightly different from the training data. There are numerous ways to train support vector machines and the simplest and fastest method is called “Sequential Minimal Optimization.”

The output of a support vector machine is of an irregular value, and not a subsequent prospect of a class given an input. However, there are recently created algorithms that could map support vector machine outputs into posterior probabilities.

Support vector machines classifier are powerful tools, specifically designed to solve large-scale classification problems that are often encountered when classifying text. For instance if you look in a one of the document that belongs to a large group of documents that is actually a related set, if you consider all the words found in the entire set, you will find more words missing from the document compare to the number of words found in the document. This is classification problem is called the sparse data matrix. Classification problems such as large number of documents along with a large number of words and the sparse data matrix, needs a classification engine that can obtain a much faster and more efficient result.

As with everything else in the market, support vector machine classifier can also be obtained from the Internet nowadays. A quick search in the net will provide you with a various system and method that could help you build fast and efficient support vector machine classifiers that are suitable for different problems, particularly ones that are related to large data classification problems such as classifying pages from the Internet as well as other problems related with sparse matrices and large numbers of documents. Though most method may differ in their make up, they have one common factor and that is all of them utilize a technique called the "kernel trick" in order to apply linear classification techniques to non-linear classification problems.

There are some methods that impose upon the least squares nature of such problems, and use the exact line search in its customary process then uses the conjugate gradient method that is suitable to the problem.

However, support vector machines are not without its share of drawbacks. One problem in support vector machine classifier is the lack of computer memory that are needed for support vector machine handling of the data normally caused by text-intensive problems like the ones found in classifying large numbers of text pages found on the Internet.

One solution that has enhanced the ability of computers to learn to classify such data is called “chunking”. Chunking refers to the process wherein the problem is broken down into more convenient pieces that are within the means of the available computer resources. Examples of chunking decomposition techniques used to decrease such problems for support vector machines are the SMO and SVM Light.

However, there is one disadvantage here though. The speed improvement is only moderate, particularly for designing classifiers like the ones needed for web pages that usually contain the largest and most difficult text problems. Keep in mind that speed is imperative. Therefore a support vector machine classifier design that is considerably faster and with a precision that corresponds to the existing classifier engines is needed in order to decrease the training time of support vector machines.

Regardless of the occasional drawbacks, a support vector machine classifier is still a tremendously powerful method of acquiring models for classification. It provides a mechanism for selecting the model structure in a natural approach that offers a low margin for error and risks. Support vector machines classifier has truly become significant tools in today’s modern society. Is it any wonder why mathematicians and scientists alike are still continuously searching for new ways to further improve these new learning machines?

Important NoticeDISCLAIMER: All information, content, and data in this article are sole opinions and/or findings of the individual user or organization that registered and submitted this article at Isnare.com without any fee. The article is strictly for educational or entertainment purposes only and should not be used in any way, implemented or applied without consultation from a professional. We at Isnare.com do not, in anyway, contribute or include our own findings, facts and opinions in any articles presented in this site. Publishing this article does not constitute Isnare.com's support or sponsorship for this article. Isnare.com is an article publishing service. Please read our Terms of Service for more information.

Article Tags: classification [See Dictionary], support [See Dictionary], vector [See Dictionary]
Got a question about this article? Ask the community!
Article published on July 29, 2006 at Isnare.com
 
Rate this article:

Microsoft To Conquer Localized Media Delivery Problems
Submitted by: Danny Wirken

From the time that commercial paid advertisements and other media content came into being, it inadvertently led to an increase in the demand for more highly targeted and effective marketing campaigns on the Internet...

Microsoft To Integrate Rss Support In Windows Operating System
Submitted by: Danny Wirken

Last year Microsoft Corporation shocked the world when they revealed their intention to build RSS (Really Simple Syndication) support in the latest version of the Microsoft Windows operating system, which is under the code-name “Longhorn...

The Latest Patent Applications: Kernel Of Technological Advancement
Submitted by: Danny Wirken

The value of freedom in a country is priceless If one country has freedom of speech and thought then they are sure to have a bright future ahead of them...

Why Wireless DA Is A Multi-Billion Dollar Industry
Submitted by: Danny Wirken

Wireless Directory Assistance (DA) is a virtual directory that offers a fast way to get directory-dependent applications online...

How To Block Direct Image Linking Using .htaccess
Submitted by: Danny Wirken

Most of us have a specified limit to the amount of traffic our web servers will handle for us That limit seems very generous – until you start looking at image downloads and the bandwidth required...

Wordpress Version 2.0.3 Review
Submitted by: Danny Wirken

WordPress, the premier free open-source blogging utility, has gone through several upgrades in its life...

Improving Customer Service Through Help Desk Software
Submitted by: Danny Wirken

Help desk have now become a core part of good business service and operation The term itself is generally associated with the end user support center...

Apple Tiger vs Windows Vista
Submitted by: Danny Wirken

Microsoft’s next-generation operating system is coming in early 2007, offering improvements that are both impressive and unprecedented in the Windows world...

Accessory Computer
Submitted by: Danny Wirken

A home away from home is a great thing, so why not have an office away from the office tooThe spare room or a quiet corner can be a perfect place for productivity...

Plantronics DSP 400 Headset
Submitted by: Danny Wirken

The Plantronics DSP 400 headset produces high quality sound whether you use it with a laptop or a desktop computer...

Diner Dash
Submitted by: Danny Wirken

Diner Dash is all about a young burnt out corporate employee named Flo She gets tired of running the rat race and so opens up her own restaurant...

Avast Antivirus Home Edition
Submitted by: Danny Wirken

Prior to trying the Avast 46 Home Edition, I was very much a Norton user...

Bookworm
Submitted by: Danny Wirken

Bookworm is a very good alternative to some of the violent action games popular today The goal is simple: spell words by linking letters found on the board...

Apple iPod Special U2 Edition
Submitted by: Danny Wirken

New iPod models have sprung up as quickly as mushrooms after the rain and each time they just seem to get better and better...

Zuma Deluxe
Submitted by: Danny Wirken

Zuma is one of those arcade games that starts off really easy and becomes more difficult with each level...

Hightech Cameras Making Sport Training Easier
Submitted by: Jesse Akre

Lately, the advances in commonly used everyday items has increased dramatically We have cell phones that can double as MP3 players, as well as having internet capabilities, video consultations on our computers, digital cameras that can download right to the computer and then be sent in for printing, and so on...

Martin Yale 1217A Autofolder Review
Submitted by: Jeff McRitchie

For years the standard in paper folding machines, the Martin Yale Intimus 1217A is well-known in the small print industry for being a solid and flexible machine...

It’s a Mod Chip World!
Submitted by: Michiel Van Kets

No Nintendo Wii game console seems complete without a mod chip installation and with today’s latest mod chip innovations it’s easier than ever to buy and install your own Wii modification chip...

Martin Yale 400 Paper Jogging Machine Review
Submitted by: Jeff McRitchie

Any business that produces and binds a lot of documents on a regular basis should have a paper jogging machine on hand...

Laminating Film For Beginners
Submitted by: Jeff McRitchie

Roll laminators are awesome machines, but sometimes it can be difficult to know what supplies you need to use with your new laminating system...

PC200 Spiral Coil Binding Machine Review
Submitted by: Jeff McRitchie

The PC200 is positioned as a low-cost spiral coil binding solution for low volume users Here we take a look at this machine and examine its strengths and weaknesses...

Martin Yale 700E Paper Cutter Review
Submitted by: Jeff McRitchie

A commercial-quality paper cutter, the Martin Yale 700E is meant to be used in smaller print shops or in-house production floors for medium to large businesses...

Rhino Tuff CI 3000 Coil Inserter Review
Submitted by: Jeff McRitchie

Rhino's CI 3000 features a unique design that purports to make it easier to do spiral coil book binding...

Lamitek PhotoPro 13 Laminator Review
Submitted by: Jeff McRitchie

There are many laminators available and sometimes it is hard to know which one you should buy It is always a good idea to get a versatile machine, such as one that can do both hot and cold lamination, while also providing a crystal-clear finish...

Lamitek Photosmart 13 Laminator Review
Submitted by: Jeff McRitchie

The emergence and increasing numbers if digital printers has sparked an interest in laminating machines that can work with high-quality photos and/or glossier printed pages...

PC200E Spiral Coil Binding Machine Review
Submitted by: Jeff McRitchie

As the least expensive spiral coil binding machine that offers disengageable dies and an electric coil inserter, the PC200E is well positioned in the marketplace...

Be Careful When Buying Cheap Adobe Software
Submitted by: Adrianna Noton

When individuals are looking to buy software they always love finding cheap Adobe software However are these really great prices too good to be true...

What is the Difference Between Standard and High Yield Toner Cartridges?
Submitted by: Adriana N

There have been improvements in the manufacturing of printer toner cartridges Toner found in a cartridge is dry powder blended with a polymer that sticks on to the paper as printing takes place...

Inverted Microscope: A Great Tool For Studying Living Cells
Submitted by: Edison Rammsey

When you hear the term inverted microscope, you probably think of observing samples from under a microscope...

Digital Microscope: Eight Reasons Why You Must Have it Now!
Submitted by: Edison Rammsey

Welcome the Digital Age through a digital microscope With its eight benefits to be enjoyed, all other microscope will look small in comparison, pun intended...

Isnare.com Footer Divider

© 2004-2009. Isnare Free Articles - An Isnare Online Technologies Free Articles Project. All Rights Reserved.   Privacy Policy