Perceptrons - University of Texas at Dallasvgogate/ai/fall16/grad/slides/perceptron.pdf · Problems...

CS6364Perceptrons

LinearClassifiers

§ Inputsarefeaturevalues§ Eachfeaturehasaweight§ Sumistheactivation

§ Iftheactivationis:§ Positive,output+1§ Negative,output-1

Σf1f2f3

w1w2w3

Weights§ Binarycase:comparefeaturestoaweightvector§ Learning:figureouttheweightvectorfromexamples

# free : 2YOUR_NAME : 0MISSPELLED : 2FROM_FRIEND : 0...

# free : 4YOUR_NAME :-1MISSPELLED : 1FROM_FRIEND :-3...

# free : 0YOUR_NAME : 1MISSPELLED : 1FROM_FRIEND : 1...

Dot product positive means the positive class

DecisionRules

BinaryDecisionRule

§ Inthespaceoffeaturevectors§ Examplesarepoints§ Anyweightvectorisahyperplane§ OnesidecorrespondstoY=+1§ OthercorrespondstoY=-1

BIAS : -3free : 4money : 2... 0 1

freemoney

+1=SPAM

-1=HAM

WeightUpdates

Learning:BinaryPerceptron

§ Startwithweights=0§ Foreachtraininginstance:

§ Classifywithcurrentweights

§ Ifcorrect(i.e.,y=y*),nochange!

§ Ifwrong:adjusttheweightvector

Learning:BinaryPerceptron

§ Startwithweights=0§ Foreachtraininginstance:

§ Classifywithcurrentweights

§ Ifcorrect(i.e.,y=y*),nochange!§ Ifwrong:adjusttheweightvectorbyaddingorsubtractingthefeaturevector.Subtractify*is-1.

Examples:Perceptron

§ SeparableCase

MulticlassDecisionRule

§ Ifwehavemultipleclasses:§ Aweightvectorforeachclass:

§ Score(activation)ofaclassy:

§ Predictionhighestscorewins

Binary=multiclasswherethenegativeclasshasweightzero

Learning:MulticlassPerceptron

§ Startwithallweights=0§ Pickuptrainingexamplesonebyone§ Predictwithcurrentweights

§ Ifcorrect,nochange!§ Ifwrong:lowerscoreofwronganswer,

raisescoreofrightanswer

Example:MulticlassPerceptron

BIAS : 1win : 0game : 0 vote : 0 the : 0 ...

BIAS : 0 win : 0 game : 0 vote : 0 the : 0 ...

“winthevote”

“wintheelection”

“winthegame”

PropertiesofPerceptrons

§ Separability:trueifsomeparametersgetthetrainingsetperfectlycorrect

§ Convergence:ifthetrainingisseparable,perceptron willeventuallyconverge(binarycase)

§ MistakeBound:themaximumnumberofmistakes(binarycase)relatedtothemargin ordegreeofseparability

Separable

Non-Separable

Examples:Perceptron

§ Non-SeparableCase

ImprovingthePerceptron

ProblemswiththePerceptron

§ Noise:ifthedataisn’tseparable,weightsmightthrash§ Averagingweightvectorsovertime

canhelp(averagedperceptron)

§ Mediocregeneralization:findsa“barely”separatingsolution

§ Overtraining:test/held-outaccuracyusuallyrises,thenfalls§ Overtrainingisakindofoverfitting

FixingthePerceptron

§ Idea:adjusttheweightupdatetomitigatetheseeffects

§ MIRA*:chooseanupdatesizethatfixesthecurrentmistake…

§ …but,minimizesthechangetow

§ The+1helpstogeneralize

*MarginInfusedRelaxedAlgorithm

MinimumCorrectingUpdate

minnotτ=0,orwouldnothavemadeanerror,sominwillbewhereequalityholds

MaximumStepSize

§ Inpractice,it’salsobadtomakeupdatesthataretoolarge§ Examplemaybelabeledincorrectly§ Youmaynothaveenoughfeatures§ Solution:capthemaximumpossiblevalueofτ withsome

constantC

§ Correspondstoanoptimizationthatassumesnon-separabledata§ Usuallyconvergesfasterthanperceptron§ Usuallybetter,especiallyonnoisydata

LinearSeparators

§ Whichoftheselinearseparatorsisoptimal?

SupportVectorMachines

§ Maximizingthemargin:goodaccordingtointuition,theory,practice§ Onlysupportvectorsmatter;othertrainingexamplesareignorable§ Supportvectormachines(SVMs)findtheseparatorwithmaxmargin§ Basically,SVMsareMIRAwhereyouoptimizeoverallexamplesatonce

Classification:Comparison

§ NaïveBayes§ Buildsamodeltrainingdata§ Givespredictionprobabilities§ Strongassumptionsaboutfeatureindependence§ Onepassthroughdata(counting)

§ Perceptrons/MIRA:§ Makeslessassumptionsaboutdata§ Mistake-drivenlearning§ Multiplepassesthroughdata(prediction)§ Oftenmoreaccurate

Perceptrons - University of Texas at Dallasvgogate/ai/fall16/grad/slides/perceptron.pdf · Problems...

Documents

Transcript of Perceptrons - University of Texas at Dallasvgogate/ai/fall16/grad/slides/perceptron.pdf · Problems...

Acceleration and Entanglement: a Deteriorating Relationshipncts.ncku.edu.tw/phys/qis/100528/files/100529RB_Mann.pdf · Local Operations and Classical Communication Separable state:

IEC Separable Connectors 36 kV and 42 kV, 630A and 1250A...SBC-B-70-120/2 70 - 120 SBC-B-150-240/2 150 - 240 SBC-B-300-400/3 300 - 400 SBC-B-500-630/3 500 - 630 STEP4 _ Selection of

A Twinkl Original I’m finαlly off on α reαl αdventure, to ... · Luckily, Mum isn’t very big, even for α grown-up. Elijαh took her feet αnd I took her shoulders αnd we

B4-EPL604 - Adaline, Madaline, Perceptrons, MLP, LMS, … - Adaline... · Αυτές είναι τα Adaline, Madaline, Perceptrons και σταδιακά τα MLP.

Witnessing entanglement for quantum interferometry · Witnessing entanglement for quantum interferometry Marco Gabbrielli Firenze 16 ∙ 02 ∙ 2017. Quantum entanglement separable.

Contentslarry/=stat401/lecture-05.pdfThis isn’t a proof, but shows why it’s worth looking for a proof. 2.1 Existence and Uniqueness On any given nite data set, it is evident from

Nonlinear Models - csd.uwo.cadlizotte/teaching/cs4414_F18/Lectures/8_Nonlinear... · Nonlinear Models Dan Lizotte 2018-10-18 Nonlinearly separable data • A linear boundary might

Quantum Entanglement, and Applications · Entanglement is a feature of compound quantum systems States that can be written are separable States that cannot be written this way are

EPS Case presentation - Livemedia.gr · EPS Case presentation Looks like VT but it isn’t! ... •Ablation catheter carefully moved along the TA searching for M potentials during

1020 ProStyle+ Shredder Manual - cdn.cnetcontent.com · E Manual de instrucciones P Manual de Instruções ... guidelines in the manual. • Make sure the power cable isn’t a trip

IEC Separable Connectors 51 kV/134 kV Coupling (Rear) T ... · IEC Separable Connectors 51 kV/134 kV Coupling (Rear) ... Tests conducted in accordance with IEC 60099-4. Minimum Corona

unc physics€¦ · 10/05/2015 · Anything else isn’t necessarily a path to everything else. “If you go straight into engineering you are confining yourself to that,” Bain

OUTER RINGS TOLERANCE (Outside diameter) (μm) · 2018. 8. 16. · 214 NOSE SEIKO CO.,LTD. NART ROLLER FOLLOWERS NON SEPARABLE WITH INNER RING NART TYPE OUTER RINGS TOLERANCE (Outside

TRAITEMENT STATISTIQUE DU SIGNAL SPECTROMETRIQUE :´ … · ii It isn’t for the money, No it isn’t for the fun, It’s a plan, a scam, a diagram It’s for the beneﬁt of everyone.

The Brownian limit of separable permutationsuser.math.uzh.ch/bouvel/presentations/PP2016.pdf · The Brownian limit of separable permutations Mathilde Bouvel (Institut fur Mathematik,

An Introduction to Spectral Learninghanxiaol/slides/spectral_learning.pdfAn Introduction to Spectral Learning Anchor Words Deﬁnition (p-separable) M is p-separable if ∀j, ∃i

Asymptotic Efficiency in High-Dimensional Covariance Estimation · 2018. 3. 5. · Common topic in random matrix theory (both for Wigner and for Wishart matrices): Bai ... a separable

arXiv:math/0510603v1 [math.FA] 27 Oct 2005arXiv:math/0510603v1 [math.FA] 27 Oct 2005 APPROXIMATION BY SMOOTH FUNCTIONS WITH NO CRITICAL POINTS ON SEPARABLE BANACH SPACES D. AZAGRA

The Riemann-Hurwitz Formulaoort0109/EigArt-RHurwitz... · 2016-08-24 · The Riemann-Hurwitz Formula 569 surjective map between Riemann surfaces, or a (separable, surjective) ﬁnite

Extracting long basic sequences from systems of dispersed vectors - Workshop Warsaw 2013set_theory/Workshop2013/schedule/... · 2013. 4. 21. · When working with a Banach space (separable