FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   PreferencesPreferences   Log in to check your private messagesLog in to check your private messages   Log inLog in 
Forum index » Science and Technology » Math » Research
Averaging gradient descent method
Post new topic   Reply to topic Page 1 of 1 [5 Posts] View previous topic :: View next topic
Author Message
Sébastien Gadat
science forum beginner


Joined: 07 Mar 2005
Posts: 6

PostPosted: Fri Apr 14, 2006 1:00 pm    Post subject: Averaging gradient descent method Reply with quote

Hi,

I am looking for informations about what happens to the dynamical system
(dimension n)

X_t = - 1/t \int_{0}^t gradient E(X_s) ds

It is something like a gradient descent averaged from time 0 to time t.
E is a smooth energy functionnal and X_t is the position of the point at
time t.
--
Sebastien
Back to top
Hans Engler
science forum beginner


Joined: 24 Mar 2005
Posts: 5

PostPosted: Sun Apr 16, 2006 6:40 pm    Post subject: Re: Averaging gradient descent method Reply with quote

If you want continuous trajectories for t >= 0, then X_0 must satisfy X_0 +
E(X_0) = 0. And then X_t = X_0 is a solution for all t. So the dynamics are
rather trivial.


"Sébastien Gadat" <sebastien.gadat@free.fr> wrote in message
news:e1o6cm$9nc$1@news.ks.uiuc.edu...
Quote:
Hi,

I am looking for informations about what happens to the dynamical system
(dimension n)

X_t = - 1/t \int_{0}^t gradient E(X_s) ds

It is something like a gradient descent averaged from time 0 to time t.
E is a smooth energy functionnal and X_t is the position of the point at
time t.
--
Sebastien

Back to top
Sébastien Gadat
science forum beginner


Joined: 07 Mar 2005
Posts: 6

PostPosted: Sun Apr 16, 2006 10:45 pm    Post subject: Re: Averaging gradient descent method Reply with quote

In fact, there was a mistake in the former formula. The good evolution
equation is:
X'_t = - 1/t \int_{0}^t gradient E(X_s) ds
starting from X_0 with X'_0=- Gradient E(X_0).
I do not find the conclusion so obvious with this equation.
--
Sebastien

"Hans Engler" <h.engler@verizon.net> a écrit dans le message de news:
e1u32f$fg$1@dizzy.math.ohio-state.edu...
Quote:
If you want continuous trajectories for t >= 0, then X_0 must satisfy X_0
+
E(X_0) = 0. And then X_t = X_0 is a solution for all t. So the dynamics
are
rather trivial.


"Sébastien Gadat" <sebastien.gadat@free.fr> wrote in message
news:e1o6cm$9nc$1@news.ks.uiuc.edu...
Hi,

I am looking for informations about what happens to the dynamical system
(dimension n)

X_t = - 1/t \int_{0}^t gradient E(X_s) ds

It is something like a gradient descent averaged from time 0 to time t.
E is a smooth energy functionnal and X_t is the position of the point at
time t.
--
Sebastien


Back to top
Hans Engler
science forum beginner


Joined: 24 Mar 2005
Posts: 5

PostPosted: Mon Apr 17, 2006 1:34 am    Post subject: Re: Averaging gradient descent method Reply with quote

That's less trivial indeed :)

Let's look at the simplest case: X_t \in R, E(x) = x. The integral equation
is equivalent to tX'' + X' + X = 0. With initial data X_0 = 1 = -X'_0, the
power series for the solution turns out to be
X_t = 0_F_1(;1,-t) = 1 - t + t^2/(2!)^2 - t^3/(3!)^2 + t^4/(4!)^2 ... , a
confluent hypergeometric limit function (see
http://mathworld.wolfram.com/ConfluentHypergeometricLimitFunction.html )
which can also be expressed in terms of a Bessel function of the first kind
as
X_t = J_0(2 \sqrt{t}) .

The asymptotics are known:

X_t ~ \pi^{-1/2}t^{-1/4} \cos(2 t^{1/2} - \pi/4).

Thus a very slow oscillatory decay to zero.

It's now easy to see what happens in the general linear case, X_t \in R^n
and E(x) = Ax + b with A symmetric and positive definite: Just transform to
principal components and deduce that |X_t - X*| ~ t^{-1/4}, where AX* + b =
0.

For general E (gradient of a multi-well coercive potential) the situation is
presumably much more complicated than for the "standard" case X_t = -E(X_t),
since the equation exhibits a "long term memory". It's most likely still
true that solutions with initial data near local minima of the potential
converge to the minimum. I'm not aware of much work on the general behavior
of such integral equations.

Hope this helps :)


"Sébastien Gadat" <sebastien.gadat@free.fr> wrote in message
news:e1uhdd$143$1@dizzy.math.ohio-state.edu...
Quote:

In fact, there was a mistake in the former formula. The good evolution
equation is:
X'_t = - 1/t \int_{0}^t gradient E(X_s) ds
starting from X_0 with X'_0=- Gradient E(X_0).
I do not find the conclusion so obvious with this equation.
--
Sebastien
Back to top
Sébastien Gadat
science forum beginner


Joined: 07 Mar 2005
Posts: 6

PostPosted: Mon Apr 17, 2006 12:45 pm    Post subject: Re: Averaging gradient descent method Reply with quote

Hi,

This is exactly what I found, in the quadratic case, we can see that the
solutions are based on Bessel functions, and if we use linearisation
approximation, we can equally seen that equilibrium are stable with rate of
convergence in t^{-1/4}. Now my main problem is to justify that the
trajectory attains a neighbourhood of a minimum of E.
By the way, in most of my experiments in Matlab, I have seen this type of
oscillating convergence to the minimum. I am consequently optimist for the
nature of the result, but I really don't know how could I achieve it!

--
Sebastien

"Hans Engler" <h.engler@verizon.net> a écrit dans le message de news:
e1ura8$1bv$1@dizzy.math.ohio-state.edu...
Quote:
That's less trivial indeed :)

Let's look at the simplest case: X_t \in R, E(x) = x. The integral
equation
is equivalent to tX'' + X' + X = 0. With initial data X_0 = 1 = -X'_0,
the
power series for the solution turns out to be
X_t = 0_F_1(;1,-t) = 1 - t + t^2/(2!)^2 - t^3/(3!)^2 + t^4/(4!)^2 ... , a
confluent hypergeometric limit function (see
http://mathworld.wolfram.com/ConfluentHypergeometricLimitFunction.html )
which can also be expressed in terms of a Bessel function of the first
kind
as
X_t = J_0(2 \sqrt{t}) .

The asymptotics are known:

X_t ~ \pi^{-1/2}t^{-1/4} \cos(2 t^{1/2} - \pi/4).

Thus a very slow oscillatory decay to zero.

It's now easy to see what happens in the general linear case, X_t \in R^n
and E(x) = Ax + b with A symmetric and positive definite: Just transform
to
principal components and deduce that |X_t - X*| ~ t^{-1/4}, where AX* + b
=
0.

For general E (gradient of a multi-well coercive potential) the situation
is
presumably much more complicated than for the "standard" case X_t
= -E(X_t),
since the equation exhibits a "long term memory". It's most likely still
true that solutions with initial data near local minima of the potential
converge to the minimum. I'm not aware of much work on the general
behavior
of such integral equations.

Hope this helps Smile
Back to top
Google

Back to top
Display posts from previous:   
Post new topic   Reply to topic Page 1 of 1 [5 Posts] View previous topic :: View next topic
The time now is Wed Jan 07, 2009 8:46 pm | All times are GMT
Forum index » Science and Technology » Math » Research
Jump to:  

Similar Topics
Topic Author Forum Replies Last Post
No new posts Help in identifying a numerical method Don num-analysis 2 Thu Jul 20, 2006 8:56 pm
No new posts descent for projectivity David Madore Research 3 Thu Jul 20, 2006 12:04 pm
No new posts troubles in determination of specific... eos Chem 0 Thu Jul 20, 2006 10:05 am
No new posts troubles in determination of specific... eos Chem 0 Thu Jul 20, 2006 10:02 am
No new posts possible to use Generalized Method of... comtech Math 1 Thu Jul 20, 2006 12:49 am

Bad Credit Mortgages | Guitar Lessons | Credit Counseling | Credit Cards UK | Fish Tank Help
Copyright © 2004-2005 DeniX Solutions SRL
Other DeniX Solutions sites: Electronics forum |  Medicine forum |  Unix/Linux blog |  Unix/Linux documentation |  Unix/Linux forums


Powered by phpBB © 2001, 2005 phpBB Group
[ Time: 0.3826s ][ Queries: 16 (0.1354s) ][ GZIP on - Debug on ]