cond-mat0311545/p61.tex
1: %\documentclass[twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
2: %\documentclass[twocolumn,showpacs]{revtex4}
3: %\documentclass[preprint,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
4: %\documentclass[prl,twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
5: 
6: % Some other (several out of many) possibilities
7: %\documentclass[preprint,aps]{revtex4}
8: %\documentclass[preprint,aps,draft]{revtex4}
9: %\documentclass[prb]{revtex4}% Physical Review B
10:  \documentclass[prb,twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
11: 
12: 
13: 
14: %================================================================================
15: \usepackage[dvips]{graphicx}% Include figure files
16: \usepackage{dcolumn}% Align table columns on decimal point
17: \usepackage{bm}% bold math
18: 
19: %%
20: %% Float stuff
21: %%
22:  \setlength{\floatsep}{4pt plus 4pt minus 4pt}
23: %%  \setlength{\textfloatsep}{7pt plus 4pt minus 4pt}
24: %%   \setlength{\intextsep}{0pt plus 2pt minus 2pt}
25:  \setcounter{totalnumber}{4}
26:  \setcounter{topnumber}{4}
27:  \setcounter{bottomnumber}{4}
28:  \renewcommand{\textfraction}{0.0} 
29:  \renewcommand{\topfraction}{1.0}
30:  \renewcommand{\bottomfraction}{1.0}
31: %%
32: %% hub.latex
33: %%
34: 
35: %\nofiles
36:   \addtolength{\voffset}{1.3cm} %shke: for letter paper
37: %\linespread{1.00}
38: 
39: \begin{document}
40: 
41: %\preprint{APS/123-QED}
42: 
43: \title{  
44: Electron Transport Through Molecules: \\
45: Self-consistent and Non-self-consistent Approaches 
46: }
47: 
48: \author{ San-Huang Ke,$^{1,2}$ Harold U. Baranger,$^{2}$ and Weitao Yang$^{1}$}
49: 
50: \affiliation{
51: $^{\rm 1}$Department of Chemistry, Duke University, Durham, NC 27708-0354 \\
52: $^{\rm 2}$Department of Physics, Duke University, Durham, NC 27708-0305
53: }
54: 
55: \date{\today}% It is always \today, today,
56:              %  but any date may be explicitly specified
57: 
58: \begin{abstract}
59: 
60: A self-consistent method for calculating electron transport through a molecular 
61: device is developed. It is based on density functional theory electronic structure 
62: calculations under periodic boundary conditions and implemented in the framework 
63: of the non-equilibrium Green function approach. To avoid the substantial computational 
64: cost in finding the $I$-$V$ characteristic of large systems, we also develop an approximate 
65: but much more efficient non-self-consistent method. Here  the change in effective potential 
66: in the device region caused by a bias is approximated by the main features of the voltage drop.  
67: As applications, the $I$-$V$ curves of a carbon chain and an aluminum chain sandwiched between 
68: two aluminum electrodes are calculated -- two systems in which the voltage drops very differently.
69: By comparing to the self-consistent results, we show that this non-self-consistent approach 
70: works well and can give quantitatively good results.         
71: 
72: \end{abstract}
73: 
74: \pacs{73.40.Cg, 72.10.-d, 85.65.+h}
75: \maketitle
76: 
77: %===================================================================
78: \section{Introduction}
79: 
80: In recent years, electron transport through molecules sandwiched between metallic electrodes 
81: has been attracting increasing attention both for fundamental reasons and because it may form 
82: the basis of a future molecular electronics technology.\cite{mol1,mol2,mol3,mol4,mol5}  
83: Experimentally, it is difficult to precisely manipulate or even measure the atomic structure 
84: of the molecule-electrode contacts. Therefore, neither the influence of atomic structure on 
85: transport through the devices nor a path to improved performance is clear.  As a result, 
86: the ability to calculate the atomic and electronic structure as well as the transport 
87: properties of electrode-molecule-electrode systems is important and useful in this field.  
88: 
89: Electron transport through nanoscale molecular devices differs significantly from that through 
90: macroscopic semiconductor heterostructures. In the latter, the effective-mass approximation 
91: is generally successful because of the periodic lattice structure and large electron wavelength.  
92: In contrast, in a molecular device a carrier electron will be scattered by only a few atoms whose 
93: particular arrangement, then, matters a great deal. Consequently, the effective-mass approximation
94: breaks down, and the electronic structure of the molecular device must be taken into account 
95: explicitly. For this purpose, methods based on density functional theory (DFT) are sufficiently 
96: accurate and efficient.\cite{dft1,dft2}  Conventional DFT methods, however, deal with either 
97: closed molecular systems (in quantum chemistry) or periodic solids (in solid state physics), 
98: neither of which is applicable to molecular transport. Thus one needs to develop a DFT approach 
99: suitable for a system which is open, infinite, non-periodic, and non-equilibrium (if the bias 
100: voltage is nonzero).
101: 
102: One way to do this was suggested by Lang, {\it et al.} \cite{lang1,lang2,lang3,lang4} 
103: By using the jellium model for the two metallic electrodes of an electrode-molecule-electrode 
104: system, they  mapped the Kohn-Sham equation of the system into the Lippmann-Schwinger scattering 
105: equation and solved for the scattering states self-consistently.  They then calculated the 
106: current by summing up the contributions from all the scattering states, following a 
107: Landauer-type approach.\cite{landauer1}  In this way, both the conductance and $I$-$V$ 
108: characteristics of the system can be investigated.  The use of the jellium model for electrodes 
109: is convenient and simple but limited: it cannot include the effects of different contact 
110: geometries and surface relaxation, for instance.  It also cannot deal with directional bonding 
111: such as in semiconductors and transition metals. As a result, the molecule-electrode charge 
112: transfer, which is one of the key factors affecting transport, may not be quantitatively 
113: correct.\cite{test}
114: 
115: Another way to develop the desired DFT approach is to use the non-equilibrium Green function 
116: (NEGF) method.\cite{negf1,negf2} The required open and non-equilibrium conditions can be 
117: treated rigorously, at least in a formal sense. This method is also closely related to the 
118: Landaur approach\cite{landauer1} and has proven to be powerful for studying electron transport 
119: through nanoscale devices.  Therefore, by combining the NEGF method with conventional DFT-based 
120: electronic structure methods used in quantum chemistry or solid-state physics, the coherent 
121: transport properties of an electrode-molecule-electrode system can be determined fully 
122: self-consistently from first-principles.  A further advantage of the NEGF+DFT combination is 
123: that the atomic structure of the device region and the metallic electrodes are treated 
124: explicitly on the same footing.  As has been mentioned, the molecule-electrode interaction 
125: will induce charge transfer between them and atomic relaxation of their contact -- both have 
126: a significant effect on electron transport. As a result, the division of the system into the 
127: molecule and the electrodes is not meaningful anymore, and some parts of the electrodes must 
128: be included into the device region to form an ``extended molecule".
129: 
130: Based on this combined NEGF+DFT method, there are several successful implementations 
131: \cite{datta1, datta2, transiesta, mcdcal,mono} for molecular conduction and extensive
132: theoretical results in the recent literature.
133: \cite{datta1, datta2, transiesta, mcdcal, xue, transiesta1, mcdcal1,mono} 
134: According to the way of treating the extended molecule, the semi-infinite leads, 
135: and their couplings in a lead-molecule-lead (LML) system,
136: these implementations can be roughly divided into two categories.
137: 
138: In one category people adopted a cluster geometry
139: for all the subsystems of a LML system 
140: or for the extended molecule with the leads
141: treated by a tight binding approach (for example, Ref. \onlinecite{datta1,xue,non-sc}).
142: It is then convenient to
143: employ well-established quantum chemistry code (like Gaussian or DMol) to do
144: the electronic structure calculation for the subsystem(s).
145: However, there are potential problems with these treatments for strong
146: molecule-lead couplings:
147: in this case it is obviously necessary to included large parts of the
148: leads into the extended molecule so that the strong molecule-lead interaction
149: can be fully accommodated. To eliminate the artificially introduced surface
150: effects an even larger system is needed, which is usually difficult
151: to deal with by a quantum chemistry code. So in practice only several (or
152: even only one) lead atoms are attached to the molecule to form an extended
153: molecule (for example, Ref. \onlinecite{xue,non-sc}).
154: In this case, significant artificial surface effects are inevitable, the contact
155: atomic relaxation cannot be included, and an accurate molecule-lead
156: coupling is not available. In addition, there may be artificial scattering at the interface
157: between the tight-binding part of the lead and the DFT part of the lead
158: (included in the extended molecule). 
159: 
160: In the other category (Ref.\onlinecite{transiesta,mcdcal}), 
161: people adopted periodic boundary conditions (PBC)
162: (as in solid state physics) with large parts of the leads included
163: in the extended molecule, so that the interaction between the molecule and its
164: images will be screened off by the metallic lead in between. In this case
165: all the potential problems mentioned above will be absent and the 
166: whole LML system becomes nearly perfect in geometry and all the subsystems are 
167: treated exactly on the same footing.
168: Two examples of successful implementations adopting the PBC are 
169: the TranSiesta package \cite{transiesta} and the MCDCAL package.\cite{mcdcal}
170: On the other hand, a drawback is introduced by PBC: when a bias is applied,
171: the Hartree potential must jump unphysically between unit cells. This has previously been
172: addressed by having an independent solution of the Poisson equation. 
173: 
174: In this paper, we first develop a fully self-consistent NEGF+DFT method with
175: PBC, which has small but important differences from the two previous implementations. 
176: The advantage of our 
177: method is that it is simple  while still rigorous: the non-equilibrium condition under a bias 
178: is fully included in the NEGF part, and, as a result, we do not need to make changes in the 
179: conventional electronic structure part. So it is straightforward to combine with any electronic 
180: structure method that uses a localized basis set.  More importantly, in this way the problem 
181: of the unphysical jumps in the Hartree potential is avoided.
182: 
183: A shortcoming of the full self-consistent (SC) NEGF+DFT approach is the large computational 
184: effort involved, especially for large systems, large bias voltages, or cases where many bias 
185: voltages need to be calculated as for $I$-$V$ characteristics.  As a result, a 
186: non-self-consistent (non-SC) method with much higher efficiency and useful accuracy 
187: is highly desirable.
188: As a step toward this goal, we also construct 
189: an approximate but much more efficient non-SC method in which the change in self-consistent 
190: effective potential in the device region caused by a bias is approximated by the main features 
191: of the voltage drop.
192: 
193: As an application of our approach, in this paper we do calculations by combining it with a very efficient electronic structure package SIESTA.\cite{siesta}  The $I$-$V$ curves of two systems with different typical voltage drop behaviors -- a carbon or aluminum chain sandwiched between two aluminum electrodes -- are calculated. Our self-consistent results are in good agreement with those from previous calculations.\cite{transiesta,mcdcal_chain}
194: By comparison to the self-consistent results, we examine the validity of the non-SC approach, showing that this approach works quite well and can give quantitatively nearly correct answers.
195: 
196: The arrangement of this paper is as follows. In Section II we give briefly a description of our implementation of the NEGF+DFT method.  Because the basic formalism of the NEGF+DFT is well established,\cite{datta1, datta2, transiesta, mcdcal} we show only those formula useful for introducing the new features of our method.  The present SC and non-SC approaches are explained in Section III.  Section IV starts with results for a carbon chain and an aluminum chain sandwiched between two Al(001) electrodes.  Our results are compared with previous results, and we discuss the validity of the non-SC approach by comparison to the self-consistent results. In Section V we summarize and conclude.
197: 
198: 
199: \section{NEGF+DFT method and its implementation}
200: 
201: \subsection{Modeling of real physical systems}
202: 
203: %------------------------------ Fig.1 ---------------------------------
204: \begin{figure}[b] 
205: \includegraphics[angle=270,width= 8.5cm]{str.eps}
206: \label{fig_str}
207: \caption{Schematic drawing of a system containing a molecule sandwiched between two metallic electrodes (leads $L$ and $R$).  The region $C$ is formed by including some parts of $L$ and $R$ so that the $C_C$ part (extended molecule) is charge neutral and the $C_L$ and $C_R$ parts have bulk properties. Because of the use of a localized basis set, the leads $L$ and $R$ can be divided into principal layers (denoted by numbers 0, 1, 2, ...). $C_{L'}$, $C_{C'}$, and $C_{R'}$ denote the parts used in the present non-SC approach (see Section II E). Their interface is called $X$ in the text.  $\mathbf{h}^{0}$'s and $\mathbf{h}^{1}$'s are the Hamiltonian matrices within and between the principal layers, respectively. }
208: \end{figure}
209: 
210: Experimentally, a molecular device system consists of at least a molecular junction coupled with two metallic electrodes (leads $L$ and $R$) under a bias $V_b$ (two-terminal system). In some cases, there is also a gate terminal applying a gate voltage on the whole system (three-terminal system). Here we consider only the two-terminal system which is schematically shown in Fig. 1. An important consideration for modeling the real physical system is the charge transfer and atomic relaxation around the two molecule-lead contact regions.  As a result, we have to include some parts of the metallic leads into the device region, forming an extended molecule. One obvious convergence criteria for the size of the extended molecule is its charge neutrality. Then the charge transfer and the potential disturbance caused by the molecule can be considered screened off outside the extended molecule region.  In order to obtain good convergence, we actually include a large part of each metallic lead into the $C$ region, so that the layers adjacent to $L$ and $R$ (i.e., $C_L$ and $C_R$ parts in Fig. 1) have bulk properties. The total Hamiltonian of the system is:
211: \begin{equation}
212: \hat{H} = \hat{H}_{LL} + \hat{H}_{CC} + \hat{H}_{RR} + \hat{H}_{LC} + \hat{H}_{CR}.
213: \end{equation}
214: Note that, here the leads $L$ and $R$ interact only through the molecular junction, so their direct interaction term $\hat{H}_{LR}$ vanishes (this can always be satisfied by using a localized basis set).
215: 
216: 
217: \subsection{Localized basis set}
218: When $\hat{H}$ is expanded in a basis set, generally only the matrix of $\hat{H}_{CC}$ (denoted $\mathbf{H}_{CC}$) is finite.  However, consider a localized (but not necessarily orthogonal) basis set, by which we mean that the overlap between any two basis functions, $\phi_{\mu}({\mathbf{r}-\mathbf{R}_1})$ and $\phi_{\nu}({\mathbf{r}-\mathbf{R}_2})$, will be zero if they are separated far enough from each other: $\mathbf{S}_{\mu \nu} \equiv \langle \mu|\nu \rangle =0$ if $|{\mathbf{R}_1-\mathbf{R}_2}| >$ certain cutoff distance. In this case, the region $C$ interacts directly only with finite parts of $L$ and $R$, and the non-zero part of the matrices $\mathbf{H}_{LC}$ and $\mathbf{H}_{CR}$ also become finite.  Furthermore, we can divide the leads $L$ and $R$ into principal layers so that any principal layer interacts only with its two nearest neighbors (see Fig. 1).  As a result, the matrices $\mathbf{H}_{LL}$ and $\mathbf{H}_{RR}$ have the following block tridiagonal form:
219: \begin{eqnarray} \label{equ_HLL}
220: (\mathbf{H}_{LL})_{ij}&=&
221: \left\{ \begin{array}{ll}
222:  \mathbf{h}_{LL}^{0}, & \textrm{if } i-j =0\\*[0.04in]
223:  \mathbf{h}_{LL}^{1}, & \textrm{if } i-j =1\\*[0.04in]
224: (\mathbf{h}_{LL}^{1})^{\dagger}, & \textrm{if } j-i =1\\*[0.04in]
225:  0, & \textrm{if } |i-j|>1, 
226: \end{array} \right. 
227: %\\
228: %(\mathbf{H}_{RR})_{ij}&=&
229: %\left\{ \begin{array}{ll}
230: % \mathbf{h}_{RR}^{0}, & \textrm{if } i-j =0\\
231: % \mathbf{h}_{RR}^{1}, & \textrm{if } j-i =1\\
232: %(\mathbf{h}_{LL}^{1})^{\dagger}, & \textrm{if } i-j =1\\
233: % 0, & \textrm{if } |i-j|>1, 
234: %\end{array} \right.
235: \end{eqnarray}
236: where $\mathbf{h}^{0}_{LL}$ and $\mathbf{h}^{1}_{LL}$ are the Hamiltonian matrices within and between the principal layers, respectively, and $i$, $j$ are principal layer indexes as shown in Fig. 1.  Because the $C_L$ and $C_R$ parts which interact directly with $L$ and $R$, have bulk properties, the non-zero part of $\mathbf{H}_{LC}$ ($\mathbf{H}_{CR}$) is just $\mathbf{h}_{LL}^1$ ($\mathbf{h}_{RR}^1$), as shown in Fig. 1.
237: 
238: In the localized basis and after the partition shown in Fig. 1, the matrix Green function $\mathbf{G}$ of the whole system, defined by
239: %\begin{equation}
240: $(E \mathbf{S} - \mathbf{H}) \mathbf{G}(E) = \mathbf{I}$,
241: %\end{equation}
242: satisfies
243: \begin{eqnarray}
244: \left[ \begin{array}{ccc}
245: E\mathbf{S}_{LL}-\mathbf{H}_{LL} & E\mathbf{S}_{LC}-\mathbf{H}_{LC} & 0 \\[0.04in]
246: E\mathbf{S}_{LC}^{\dagger}-\mathbf{H}_{LC}^{\dagger} & E\mathbf{S}_{CC}-\mathbf{H}_{CC} & 
247:                                                        E\mathbf{S}_{CR}-\mathbf{H}_{CR} \\[0.04in]
248: 0 & E\mathbf{S}_{CR}^{\dagger}-\mathbf{H}_{CR}^{\dagger} & E\mathbf{S}_{RR}-\mathbf{H}_{RR}
249: \end{array} \right] \\ \nonumber 
250: \times 
251: \left[ \begin{array}{ccc}
252: \mathbf{G}_{LL} & \mathbf{G}_{LC} & \mathbf{G}_{LR} \\
253: \mathbf{G}_{CL} & \mathbf{G}_{CC} & \mathbf{G}_{CR} \\
254: \mathbf{G}_{RL} & \mathbf{G}_{RC} & \mathbf{G}_{RR}
255: \end{array} \right]
256: =
257: \left[ \begin{array}{ccc}
258: \mathbf{I}_{LL} & 0 & 0 \\
259: 0 & \mathbf{I}_{CC} & 0 \\
260: 0 & 0 & \mathbf{I}_{RR}
261: \end{array} \right].
262: \end{eqnarray}
263: The most important part of $\mathbf{G}$ is $\mathbf{G}_{CC}$, corresponding to the region $C$; from the above equation,
264: \begin{equation}
265: \label{equ_Gc}
266: \mathbf{G}_{CC}(E)=\left\{E\mathbf{S}_{CC}-\left[\mathbf{H}_{CC}+
267:                 \mathbf{\Sigma}_L(E)+\mathbf{\Sigma}_R(E)\right]\right\}^{-1},
268: \end{equation}
269: where $\mathbf{\Sigma}_L(E)$ and $\mathbf{\Sigma}_R(E)$ are self-energies which incorporate the effect of the two semi-infinite leads $L$ and $R$, respectively.
270: $\mathbf{\Sigma}_L(E)$, for example, is defined by
271: \begin{eqnarray}
272: \mathbf{\Sigma}_L(E)\!=\!\left( E\mathbf{S}_{LC}-\mathbf{H}_{LC}\right)^{\dagger}
273:                      \mathbf{G}_{LL}^0(E)\left( E\mathbf{S}_{LC}-\mathbf{H}_{LC}\right)
274: %\\
275: %\mathbf{\Sigma}_R(E)\!=\!\left( E\mathbf{S}_{CR}-\mathbf{H}_{CR}\right)
276: %                     \mathbf{G}_{RR}^0(E)\left( E\mathbf{S}_{CR}-\mathbf{H}_{CR}\right)^{\dagger},
277: \end{eqnarray}
278: where $\mathbf{G}_{LL}^0$ 
279: %and $\mathbf{G}_{RR}^0$ 
280: is the retarded Green function of the left semi-infinite lead. The latter is given in turn by
281: \begin{eqnarray}
282: \mathbf{G}_{LL}^0(E)&=&\left(z\mathbf{S}_{LL}-\mathbf{H}_{LL}\right)^{-1},
283: \\
284: %\mathbf{G}_{RR}^0(E)&=&\left(z\mathbf{S}_{RR}-\mathbf{H}_{RR}\right)^{-1},\\
285: z&=&E+i\eta, \nonumber
286: \end{eqnarray}
287: where a typical value for the lifetime broadening $\eta$ is about 1~meV.  
288: 
289: Because of the localized basis set, the non-zero part of $\mathbf{S}_{LC}$, $\mathbf{H}_{LC}$, $\mathbf{S}_{CR}$, and $\mathbf{H}_{CR}$ become finite (being $\mathbf{s}_{LL}^1$, $\mathbf{h}_{LL}^1$, $\mathbf{s}_{RR}^1$, and $\mathbf{h}_{RR}^1$).  As a result, only the part of $\mathbf{G}_{LL}^0$ and $\mathbf{G}_{RR}^0$ corresponding to the $0^{th}$ principal layer of the two leads (denoted $\mathbf{g}_{LL}^0$ and $\mathbf{g}_{RR}^0$) are needed for calculating the non-zero part of the self-energies: 
290: \begin{eqnarray}
291: \mathbf{\Sigma}_L(E)\!=\!\left( E\mathbf{s}_{LL}^1-\mathbf{h}_{LL}^1\right)^{\dagger}
292:                     \mathbf{g}_{LL}^0(E)\left( E\mathbf{s}_{LL}^1-\mathbf{h}_{LL}^1\right). 
293: %\\
294: %\mathbf{\Sigma}_R(E)\!=\!\left( E\mathbf{s}^1_{RR}-\mathbf{h}_{RR}^1\right)
295: %        \mathbf{g}_{RR}^0(E)\left( E\mathbf{s}_{RR}^1-\mathbf{h}_{RR}^1\right)^{\dagger}.
296: \end{eqnarray}
297: Our notation here follows that for the Hamiltonian: $\mathbf{s}$ and $\mathbf{g}$ are submatrices of the corresponding upper case matrices.  $\mathbf{g}_{LL}^0$ and $\mathbf{g}_{RR}^0$ are simply the surface Green functions of the two semi-infinite leads.  $\mathbf{g}_{LL}^0$, for example, can be calculated either by simple block recursion,
298: %\begin{widetext}
299: \begin{eqnarray}
300: \mathbf{g}_{LL}^0(E)&=&\big[ z\mathbf{s}_{LL}^0 - \mathbf{h}_{LL}^0 \\ 
301:  &-& \left( z\mathbf{s}_{LL}^1-\mathbf{h}_{LL}^1\right)^{\dagger} 
302:  \mathbf{g}_{LL}^0(E)\left( z\mathbf{s}_{LL}^1-\mathbf{h}_{LL}^1\right) \big]^{-1}, \nonumber
303: %\\
304: %\mathbf{g}_{RR}^0(E)&=&\left[ z\mathbf{s}_{RR}^0 - \mathbf{h}_{RR}^0 
305: % - \left( z\mathbf{s}_{RR}^1-\mathbf{h}_{RR}^1\right) 
306: % \mathbf{g}_{RR}^0(E)\left( z\mathbf{s}_{RR}^1-\mathbf{h}_{RR}^1\right)^{\dagger} \right]^{-1},
307: \end{eqnarray}
308: %\end{widetext}
309: or by a renormalization method \cite{renorm} in terms of 
310: $\mathbf{s}_{LL}^0$, $\mathbf{s}_{LL}^1$, $\mathbf{h}_{LL}^0$, and $\mathbf{h}_{LL}^1$
311: which can be determined by separate DFT calculations for the two leads.  For small lifetime broadening $\eta$ (1~meV), we find that the renormalization method is much faster than simple block recursion. This is natural since $n$ renormalization interations incorporate $2^n$ principal layers, while $n$ recursions incorporate only $n$. 
312: 
313:  From $\mathbf{G}_{CC}(E)$, the projected density of states (PDOS) on the molecule (indicated by $m$) is given by
314: \begin{equation}
315: N_{m}(E)=-\frac{1}{\pi} \textrm{Im} \left\{ \textrm{Tr}_{m} 
316: \left[\mathbf{G}_{CC}(E+i\eta)\cdot \mathbf{S}_{CC}\right]\right\},
317: \end{equation}
318: where $\textrm{Tr}_{m}$ means the trace is performed only on the molecular part of the matrix.
319: 
320: 
321: %\subsection{NEGF+DFT techniques}
322: \subsection{Current}
323: 
324: The Non-Equilibrium Green Function technique (NEGF)\cite{negf2,negf_i, negf2, negf_i2} provides a convenient way to calculate the current by post-processing a DFT calculation. The result is quite natural and intuitive: First, the basic assumption is that there is no energy relaxation within the molecular region. Then, following a Landauer-like point of view,\cite{landauer1,negf2} one divides the electrons in the molecule into two sets using scattering-wave states, those that came from the left lead and those that came from the right. The left-lead states are, of course, filled up to the chemical potential in the left lead, $\mu_L$, while the right-lead states are filled up to $\mu_R$. In equilibrium, the two chemical potentials are equal, and the current carried by the left-lead states is, of course, equal to that carried by the right-lead states. As a bias is applied, the balance between the two types of states is disrupted and current flows. As different states are populated because of the change in chemical potentials, the charge density in the molecule also changes. The potential profile must be solved for self-consistently in order to get an accurate measure of the transmission. It is this self-consistency which is the time-consuming part of the calculation.
325: 
326: The expression for the steady-state current through the $C$ region for applied bias $V_b$ is
327:    \begin{equation} \label{equ_I}
328: I(V_b)= - \frac{2e^2}{h}\int^{+\infty}_{-\infty}\!\!\!\!T(E,V_b)
329:          \left[f(E\!-\!\mu_L)-f(E\!-\!\mu_R)\right]dE,
330:    \end{equation}
331: where $\mu_L$ and $\mu_R$ are the chemical potentials, $f$ is the Fermi function, and $T(E,V_b)$ is the transmission probability for electrons from the left lead to right lead with energy $E$ under bias $V_b$. The transmission probability is related to Green functions by
332:    \begin{equation} \label{equ_t}
333:  T(E,V_b)=\textrm{Tr}\left[\mathbf{\Gamma}_L(E)\mathbf{G}_{CC}(E)
334:          \mathbf{\Gamma}_R(E)\mathbf{G}_{CC}^{\dagger}(E)\right],
335:    \end{equation}
336: where
337:    \begin{eqnarray} 
338:  \mathbf{\Gamma}_{L,R}(E) &=&i\left(\mathbf{\Sigma}_{L,R}(E)                        
339:       -\left[\mathbf{\Sigma}_{L,R}(E)\right]^{\dagger}\right)
340:    \end{eqnarray} 
341: reflect the coupling at energy $E$ between the $C$ region and the leads $L$ and $R$, respectively.
342: 
343: The charge density corresponding to the above picture of left-lead states filled to $\mu_L$ and right-lead states filled $\mu_R$ can also be expressed in terms of Green functions. In particular, the density matrix of region $C$ in the basis-function space is
344: \begin{widetext}
345: \begin{eqnarray}
346: \mathbf{D}_{CC}&=&\frac{1}{2\pi}\int_{-\infty}^{+\infty}dE\left[ \label{equ_d1} \label{equ_d2} 
347:          \mathbf{G}_{CC}(E)\mathbf{\Gamma}_L(E)\mathbf{G}_{CC}^{\dagger}(E)f(E-\mu_L) 
348:        + \mathbf{G}_{CC}(E)\mathbf{\Gamma}_R(E)\mathbf{G}_{CC}^{\dagger}(E)f(E-\mu_R)\right] \\ 
349:       &=&-\frac{1}{\pi}\int_{-\infty}^{+\infty}dE\textrm{Im} \label{equ_d3}
350:            \left[\mathbf{G}_{CC}(E)f(E-\mu_L)\right] \\ \nonumber   
351:      &&+ \frac{1}{2\pi}\int_{-\infty}^{+\infty}dE\left[
352:            \mathbf{G}_{CC}(E)\mathbf{\Gamma}_R(E)\mathbf{G}_{CC}^{\dagger}(E)\right] 
353:        \times  \left[f(E-\mu_R)-f(E-\mu_L) \right].       
354: \end{eqnarray} 
355: \end{widetext}
356: Time-reversal symmetry ($\mathbf{G}_{CC}^{\dagger}=\mathbf{G}_{CC}^{*}$) was invoked in going from (\ref{equ_d2}) to (\ref{equ_d3}).  The integrand of the first term of (\ref{equ_d3}) is analytic (all poles of $\mathbf{G}_{CC}(E)$ are on real axis), so the integral can be evaluated easily by complex contour integration.  However, the integrand of the second term is not analytic, so it must be evaluated by integrating very close to the real axis using a very fine energy mesh. The whole integration path\cite{datta2} is shown in Fig. 2. Because we construct the region $C$ such that $C_L$ and $C_R$ have essentially bulk properties, we can use the bulk density matrix for them.
357: 
358: %------------------------------------ Fig.2 ----------------------------
359: \begin{figure}[b] \label{fig_path}
360: \includegraphics[angle=0,width= 5.0cm]{path.eps}
361: \caption{Schematic drawing of the integration path in the complex energy plane used to calculate the density matrix [Eq. (\ref{equ_d3})].  $E_B$ is the lowest energy of occupied states, and $\mu_{L,R}$ are the chemical potentials of the left and right leads, respectively ($\mu_{L}$ $<$ $\mu_{R}$ is assumed).  Note that for energy window [$E_B$,$\mu_{L}$] a complex contour integration is performed while for energy window [$\mu_{L}$,$\mu_{R}$] a direct energy integration is performed by using a fine energy mesh and a very small imaginary part.} 
362: \end{figure}
363: 
364: The calculated density matrix is then output to the DFT part to calculate the electron density $\rho$ and to construct a new $\mathbf{H}_{CC}$:
365: \begin{equation}
366: \rho(\mathbf{r})=\sum_{\mu,\nu}\phi^{*}_{\mu}(\mathbf{r})\textrm{Re}
367:                   \left[\left(\mathbf{D}_{CC}\right)_{\mu\nu}\right]\phi_{\nu}(\mathbf{r}),
368: \end{equation}
369: \vspace*{-5mm}
370: \begin{equation}
371: (\mathbf{H}_{CC})_{\mu\nu}=\left\langle\mu\left|\hat{T}+\hat{V}_{\textrm{ext}}(\mathbf{r})
372:                  +\hat{V}_{\textrm{H}}[\rho(\mathbf{r})]
373:                  +\hat{V}_{\textrm{xc}}[\rho(\mathbf{r})]\right|\nu\right\rangle, \label{equ_H}
374: \end{equation}
375: where $\hat{T}$ is the kinetic energy, and $\hat{V}_{\textrm{ext}}$, $\hat{V}_{\textrm{H}}$, and $\hat{V}_{\textrm{xc}}$ are the external, Hartree, and exchange-correlation potential energies, respectively.  The new $\mathbf{H}_{CC}$ replaces the old, a new $\mathbf{D}_{CC}$ is calculated, and so on until $\mathbf{H}_{CC}$ or $\mathbf{D}_{CC}$ converges.  Finally, the transmission function $T(E)$ can be calculated by Eq. (\ref{equ_t}).
376: 
377: One subtlety here is the different boundary conditions used in the Green function and DFT parts 
378: -- open versus periodic, respectively. 
379: This means that some iteration must be done even at $V_b = 0$. 
380: If the supercell of the DFT part has the same size as region $C$ in Fig. 1, 
381: then the $C_L$ and $C_R$ parts will interact directly due to the periodic boundary condition. 
382: However, this interaction is absent in the calculation of the density matrix. 
383: The same problem exists also for $\mathbf{H}_{CC}$.  
384: So we need to do some translation work between the Green function and DFT parts: 
385: to add this interaction when we go from NEGF to DFT by using the density matrix elements 
386: between two adjacent principal layers, and to remove it when we go from DFT to NEGF 
387: by setting corresponding parts of $\mathbf{S}_{CC}$ and $\mathbf{H}_{CC}$ to zero. 
388: Generally, the supercell of the DFT part can be made larger than the size of the region $C$,
389: especially for systems without a translational symmetry, because the DFT part is usually 
390: much cheaper than the Green function part.
391: 
392: \section{New self-consistent and non-self-consistent approaches}
393: 
394: \subsection{Bias voltage}
395: 
396: For non-zero $V_b$, care must be taken to account for the effects of the bias voltage on the charge density. One way to proceed is to apply a constant field in the direction parallel to the leads within the supercell of the DFT calculation. Thus a linear drop is added to the external potential in Eq. (\ref{equ_H}), and the effect on the charge density follows from, for instance, solving the Poisson equation. This approach is not straightforward for periodic boundary conditions because of the artificial potential jumps at the two supercell boundaries in the lead direction.  One way to eliminate the unphysical jumps is to use a larger supercell for the DFT calculation and replace the Hamiltonian of each of the regions near the potential jumps by the bulk one with a constant potential shift given by the bias voltage; this is implemented in the Transiesta program.\cite{transiesta}
397: 
398: Here we propose a different approach to handle the bias, one which is less obvious but turns out to be simpler in the end: The bias is included through the density matrix ($\mathbf{D}_{CC}$) in the Green function calculation instead of the potential ($\mathbf{H}_{CC}$) in the DFT part.  Specifically, we calculate the density matrix by Eq. (\ref{equ_d1}) under the boundary condition that there is a potential difference $V_b$ between part $C_L$ (together with the left lead) and part $C_R$ (together with the right lead).  This is done by shifting all the potentials related to the left (right) lead and the $C_L$ ($C_R$) part by $-V_b/2$ ($+V_b/2$).  Shifting the potential in a lead is equivalent to directly shifting the energy by the opposite amount, so
399: \begin{widetext}
400: \begin{equation}
401: \mathbf{D}_{CC} = \frac{1}{2\pi}\int_{-\infty}^{+\infty}dE\left[ \label{equ_d4}
402:             \mathbf{G}_{CC}(E)\mathbf{\Gamma}_L(E+\frac{eV_b}{2})
403:             \mathbf{G}_{CC}^{\dagger}(E)f(E-\mu_L) 
404:            + \mathbf{G}_{CC}(E)\mathbf{\Gamma}_R(E-\frac{eV_b}{2})
405:             \mathbf{G}_{CC}^{\dagger}(E)f(E-\mu_R)\right] \,.
406: \end{equation}
407: Here the Green function $\mathbf{G}_{CC}(E)$ has all the potential shifts included,
408: \begin{equation}
409: \mathbf{G}_{CC}(E) = \left(E\mathbf{S}_{CC}-\left[\mathbf{H}_{CC}
410:  +\Delta\mathbf{H}_{C_L}+\Delta\mathbf{H}_{C_R} 
411:  +\mathbf{\Sigma}_L(E+\frac{eV_b}{2})
412:     +\mathbf{\Sigma}_R(E-\frac{eV_b}{2})\right]\right)^{-1}, \label{GccVb}
413: \end{equation}
414: \end{widetext}
415: where the $C_L$ and $C_R$ parts of $\mathbf{H}_{CC}$ are replaced by $\mathbf{h}^0_{LL}$ and $\mathbf{h}^0_{RR}$, and their potential shifts are 
416: \begin{eqnarray}
417: \left[\Delta\mathbf{H}_{C_L}\right]_{\mu\nu}&=&
418:  -\frac{eV_b}{2}
419:   \left[\mathbf{S}_{CC}\right]_{\mu\nu}, \label{equ_sh1}
420:   \forall  \mu,\nu \in C_L,\\ 
421: \left[\Delta\mathbf{H}_{C_R}\right]_{\mu\nu}&=&
422:  +\frac{eV_b}{2}
423:   \left[\mathbf{S}_{CC}\right]_{\mu\nu},
424:   \forall  \mu,\nu \in C_R \,. \label{equ_sh2}
425: \end{eqnarray}
426: The actual computational process is exactly the same as that of (\ref{equ_d3}) and Fig. 2, and self-consistency is achieved in the same way as for zero bias. Finally, in the calculation of the transmission, the potential shift present in the self-energies appears in $\Gamma_{L,R}$ as in (\ref{equ_d4}),
427: \begin{equation} \label{equ_t'}
428: T(E,V_b)=\textrm{Tr}\Big\{
429:    \mathbf{\Gamma}_L(E+\frac{eV_b}{2})\mathbf{G}_{CC}(E)
430:  \mathbf{\Gamma}_R(E-\frac{eV_b}{2})\mathbf{G}_{CC}^{\dagger}(E)\Big\}.
431:  \end{equation}
432: 
433: While the two approaches mentioned here -- the linear external potential and the potential shift in the leads -- give quite different results in the unphysical non-interacting limit, self-consistency ensures that they give the same result in physical cases. Our approach has the distinct technical advantage that the Green function and DFT calculations are completely disconnected, allowing the transport module to be easily combined with a wide variety of electronic structure codes. 
434: 
435: \subsection{Approximate non-self-consistent approaches}
436: 
437: For large systems under large bias, the full SC approach described above is computationally very difficult. The longest part of a one-time calculation is finding the surface Green functions (for all the points in the energy mesh), even with the fast renormalization method.  However, one only needs to do these calculations once and save the results. The major cost for a full self-consistent calculation is from finding $\mathbf{G}_{CC}(E)$ by Eq. (\ref{equ_Gc}) at the many energies needed for the density matrix, especially for the very fine mesh in the energy window [$\mu_{L}$, $\mu_{R}$] (see Fig. 2).
438: 
439: To avoid the complex procedure and large computational effort of full self-consistency, here we propose an approximate non-SC approach in which the bias is included by (a) applying a potential shift $-eV_b/2$ ($+eV_b/2$) to the left (right) lead through the self-energies as in Eq. (\ref{equ_t'}),
440: and (b) introducing potential shifts non-self-consistently into the region $C$. The main consideration is that it may be a good approximation to replace the change in self-consistent effective potential caused by a bias by the main features of the voltage drop. This assumes, of course, that one can guess or motivate the main features in advance. For instance, for conductive molecular devices, the bias voltage will drop mainly at the left and right contact regions if the contacts have low transparency or over the whole molecular region if the contacts are very transparent.  
441: 
442: In our approach, we introduce new parts on the left ($C_{L'}$) and right ($C_{R'}$) within the region $C$ which extend from their respective leads to the molecular contacts, as shown in Fig. 1.  We will denote by $X$ the interface between the molecule $C_{C'}$ and the regions $C_{L'}$ plus $C_{R'}$. If the voltage drop around the left (right) contact is $\alpha V_b$ ($(1-\alpha) V_b$), then the potential shifts are applied in the following way (which we call the $\Delta\mathbf{H}1$ treatment):
443: \begin{eqnarray}
444: \mathbf{H}_{CC}^{'}&=&\mathbf{H}_{CC}+\left(\alpha-\frac{1}{2}\right)eV_b \mathbf{S}_{CC}, \\
445: \left[\Delta\mathbf{H}_{C_L'}\right]_{\mu\nu}&=&-e \alpha V_b \left[
446:   \mathbf{S}_{CC}\right]_{\mu\nu}, \label{equ_shl'} 
447:   \mu \textrm{\bf \;or } \nu \in C_{L'}, \\
448: \left[\Delta\mathbf{H}_{C_R'}\right]_{\mu\nu}&=&e (1-\alpha) V_b \left[
449:   \mathbf{S}_{CC}\right]_{\mu\nu}, 
450:   \mu \textrm{\bf \;or } \nu \in C_{R'}. \label{equ_shr'} 
451: \end{eqnarray}
452: Because the potential shift is applied to a matrix element when \textit{either} orbital index is in the $C_{L',R'}$ part, the voltage drop will be slightly smeared around the interface $X$.  To explicitly show the role of $\mathbf{S}_{CC}$ in (\ref{equ_shl'}) and (\ref{equ_shr'}), we also drop the potential using (called the $\Delta\mathbf{H}0$ treatment)
453: \begin{eqnarray}
454: \left[\Delta\mathbf{H}_{C_L'}\right]_{\mu\nu}&=&-e \alpha V_b \left[
455:   \mathbf{S}_{CC}\right]_{\mu\nu}, \label{equ_shl''}
456:   \mu, \nu \in C_{L'}, \\
457: \left[\Delta\mathbf{H}_{C_R'}\right]_{\mu\nu}&=&e (1-\alpha) V_b \left[
458:   \mathbf{S}_{CC}\right]_{\mu\nu},
459:   \mu, \nu \in C_{R'}. \label{equ_shr''}
460: \end{eqnarray}
461: In either case, $\mathbf{G}_{CC}(E)$ is determined by an equation analogous to Eq. (\ref{GccVb}) in the SC case,
462: %\begin{widetext}
463: \begin{eqnarray}
464: \label{equ_Gc''}
465: \mathbf{G}_{CC}(E)&=&\left(E\mathbf{S}_{CC}-\left[\mathbf{H}_{CC}^{'}+\Delta\mathbf{H}_{C_L'}+
466:                      \Delta\mathbf{H}_{C_R'} \right. \right. \\ \nonumber 
467:                  &&+\mathbf{\Sigma}_L(E+\frac{eV_b}{2}) \left. \left.
468:                    +\mathbf{\Sigma}_R(E-\frac{eV_b}{2})\right]\right)^{-1} \,.
469: \end{eqnarray}
470: %\end{widetext}
471: The initial $\mathbf{H}_{CC}$ matrix here comes from a separate DFT calculation using a large $L$-$C$-$R$ supercell. Finally, the current is calculated as usual through Eqs. (\ref{equ_t'}) and (\ref{equ_I}).
472: 
473: So far we have considered the simplest case of low transparency contacts so that the voltage 
474: drops in only two places near the contacts.  In this simplest case the $\alpha$ parameter here 
475: has the same role as the $\eta$ parameter in Ref. \onlinecite{datta0}.
476: For a real device, the voltage drop may be much more complicated, 
477: and there may be several different voltage drops inside the device region. 
478: However, in our method all the factors affecting the voltage drop have been taken into account
479: at the DFT level, and it is straightforward 
480: to generalize the present non-SC approach for these more complicated situations (see calculations 
481: for system B in Section IV) as long as the main features of the voltage drop are known.  
482: Actually, we will show later that results of this non-SC approach are not very sensitive to the 
483: choice of the voltage drop.  If we assume that the form of the drop is not affected significantly 
484: by a change in the bias voltage itself, then the main features of the voltage drop in a system can
485: be determined by a single self-consistent calculation using a relatively small bias voltage.  
486: %Or more simply, we can guess the main features and then check by simply comparing the 
487: %transmission functions from the SC and non-SC calculations.  
488: 
489: %------------------------------ Fig. 3 -------------------------------------
490: \begin{figure}[tb] \label{fig_cells}
491: \includegraphics[angle=0,width= 8.5cm]{c7al_1.eps}
492: \includegraphics[angle=0,width= 8.5cm]{al7al.eps}
493: \caption{(Color online) Systems calculated: (a) a chain of 7 carbon atoms sandwiched between two Al leads in the (001) direction of bulk Al, (b) a chain of 7 Al atoms sandwiched between the same leads.  The C-Al distance in (a) is 1{\AA} and the C-C distance is 1.323{\AA}.  The Al-Al (chain) distance in (b) is 2.86{\AA} and the Al-Al(surface) distance is 2.025{\AA} (i.e., the two Al atoms at the ends of the chain are in their bulk positions).  The notations for different parts are the same as in Fig. 1.  The numbers 0, 1, 2, 3, 4, 5, 6 denote different interfaces, called the interface $X$, between $C_{L'}$ or $C_{R'}$ and $C_{C'}$ which are used in the present non-self-consistent approach.  }
494: \end{figure}
495: 
496: \section{Applications: Chains of Carbon or Aluminum} 
497: 
498: We report calculations of $I$-$V$ curves for two systems: a \textit{carbon chain (system A)} or an \textit{aluminum chain (system B)} sandwiched between two aluminum leads in the (001) direction of bulk Al.  The structures are shown in Fig. 3. No further atomic relaxation is performed for 
499: simplicity and for direct comparison with previous results. 
500: 
501: Our implementation of the transmission calculation is independent of the DFT part.  Therefore, it can be easily combined with any DFT package that uses a localized basis set.  As an application, here we combine it with the very efficient full DFT package SIESTA,\cite{siesta} which adopts a LCAO-like and finite-range numerical basis set and makes use of pseudopotentials for atomic cores.  
502: In our calculations we adopt a single zeta (SZ) basis set. To check the 
503: convergence of the results, we also calculate the equilibrium transmission function using a single zeta plus polarization (SZP) basis set.  The difference is found to be minor. The PBE version of the generalized gradient approximation \cite{pbe} is adopted for the electron exchange and correlation, and optimized Troullier-Martins pseudopotentials \cite{tmpp} are used for the atomic cores.  The initial density matrix of the region $C$ is obtained from a separate DFT calculation using a large $L$-$C$-$R$ supercell.
504: 
505: %------------------------------ Fig. 4 ---------------------------------------
506: \begin{figure}[tb] \label{fig_vdmap}
507: \includegraphics[angle=-90,width= 5.0cm]{c7al_vd_map_-1v.eps}
508: \includegraphics[angle=-90,width= 5.0cm]{al7al_vd_map_-1v.eps}
509: \caption{(Color online) Voltage drop in a plane going through the atomic chain for the $C_C$ region of (a) the C chain and (b) the Al chain, as shown in Fig. 3, for an applied bias of  -1 V. Note that (1) in (a) the voltage drop mainly occurs around the right contact region while in (b) it occurs over the entire chain, and (2) the oscillation in (b) is much larger than that in (a). This is because the electrons in the Al chain are more free than that in the C chain; as a result, the polarization induced by the bias is larger in the Al chain. 
510: }
511: \end{figure}
512: 
513: There are two main reasons for us to choose to study these systems.  First, the transmission function $T(E)$ of system A has been calculated by both the TranSiesta \cite{transiesta} and MCDCAL \cite{mcdcal_chain} packages using a SZ basis set.  So we can make a direct comparison to previous results.  Second, systems A and B typify two different voltage drop behaviors (although both the C and Al chains are conductive).  In system A, because the molecule-lead contact is a hetero-interface, the voltage drop will mainly occur around the two interfaces [see Fig. 4 (a), note that the voltage drops around the two interfaces are actually asymmetric].  In contrast, in system B the molecule-lead contact is a homo-interface, and furthermore the two Al atoms at the ends of the chain are at their bulk positions. So the voltage drop will occur over the entire Al chain in some way [see Fig. 4 (b)].  Our purpose is to see whether our non-SC approach can handle these different conditions.
514: 
515: 
516: %----------------------- Fig. 5 ----------------------------------------
517: \begin{figure}[bt] \label{fig_t0_pdos_a_b}
518: \includegraphics[angle=0,width=8.5cm]{t0_pdos_a_b.eps}
519: \caption{Calculated transmission function $T(E)$ (solid line) and projected density 
520: of states (PDOS, projected on the chains, dashed line) 
521: by the self-consistent approach for (a) the carbon chain (system A) and
522: (b) the aluminum chain (system B) under zero bias voltage. When $T(E)$ is significantly different from the PDOS, it means that a localized state exists at that energy. }
523: \end{figure}
524: 
525: \subsection{Transmission functions}
526: 
527: In Fig. 5 we show the calculated transmission functions and PDOS (projected on the chains) for system A and B under zero bias voltage. As it can be seen, the transmission function generally follows the PDOS except for some localized states (for instance, around 4 $\sim$ 5 eV in (a) and 1 eV in (b)) which are not coupled with the left or the right lead. Our result of $T(E)$ for system A is in very good agreement with the previous results from TranSiesta and MCDCAL packages \cite{transiesta,mcdcal_chain} (see Fig. 6 (a) of Ref. \onlinecite{transiesta}).
528: 
529: 
530: %----------------------------- Fig. 6-------------------------------------
531: \begin{figure}[tb] \label{fig_t_c7al_1v}
532: \includegraphics[angle=0,width=8.5cm]{t_c7al_1v_1.eps}
533: \caption{Transmission function $T(E)$ for the carbon chain (system A) under
534: 1 V bias ($\alpha=0.5$) from the SC approach and the non-SC  
535: approach ($\Delta\mathbf{H}1$ treatment) with different choices 
536: for the interface $X$ between $C_{C'}$ and the $C_{L'}$ or
537: $C_{R'}$ parts (denoted by `1', ..., `5')
538: as shown in Fig. 3 (a). Note the similarity between the different non-SC results and the fully SC result once $X$ is placed in the contact regions ($\ge 3$). }
539: \end{figure}
540: 
541: 
542: In Fig. 6 we show, for system A under a bias of 1.0V ($\alpha=0.5$), the calculated $T(E)$ by the SC approach and the non-SC approach ($\Delta\mathbf{H}1$ treatment) with different choices for the interface $X$ between the $C_{L'}$ or $C_{R'}$ and $C_{C'}$ parts.  The first thing we note is that our SC result for 1V bias is in very good agreement with the previous self-consistent results \cite{transiesta,mcdcal_chain} (see Fig. 6 of Ref. \onlinecite{transiesta}).  When we change the interface $X$ (denoted by the different numbers in Fig. 6) from deep in the leads to the contact regions (i.e., interface $X$ = 1 $\rightarrow$ 2 $\rightarrow$ 3), the non-SC result varies significantly. However, as $X$ moves into the contact regions (i.e., interface $X$ = 3, or 4, or 5), the result becomes very close to the SC result and insensitive to the exact position of $X$.  This result is just what we expect because in system A the bias voltage drops mainly around the hetero-interface contact regions. This can be regarded as an advantage of the present non-SC approach:its result is not strongly dependent on the technical choice.
543: 
544: 
545: %-------------------------------- Fig. 7 ----------------------------------
546: \begin{figure}[tb] \label{fig_t_al7al_1v}
547: \includegraphics[angle=0,width=8.5cm]{t_al7al_1v_1.eps}
548: \caption{Calculated transmission function $T(E)$ for the Al chain system under
549: 1 V bias ($\alpha=0.5$, $\Delta\mathbf{H}1$ treatment for non-SC) by the SC and non-SC approaches.
550: %: present self-consistent (SC) approach,
551: %existing non-self-consistent (non-SC) approach, and present non-self-consistent
552: %approach with different chooses of the interface between the $C_{L'}$ or
553: %$C_{R'}$ and $C_{C'}$ regions (denoted by `1', `2', `3', `4', and `5')
554: %as shown in Fig. \ref{fig_cells} (b).
555: The notations are similar to those in Fig. 6}
556: \end{figure}
557: 
558: Similar calculations of $T(E)$ for system B under $V_b$ = 1.0V ($\alpha=0.5$, $\Delta\mathbf{H}1$ treatment for non-SC) are shown in Fig. 7.  Again, after the interface $X$ is moved into the contact regions the different non-SC result become quite close.  However, compared to the case of system A, the agreement with the SC result is not good, especially in the energy range around the (averaged) Fermi level: the transmission from the non-SC calculations (except for interface $X$ = 5) is noticeably larger than that from the SC calculation. This substantial disagreement originates in the difference between the self-consistent effective potential [Fig. 4(b)] and the non-self-consistent one assumed in the non-SC approach.
559: 
560: 
561: %-------------------------------- Fig. 8 -------------------------------------
562: \begin{figure}[tb] \label{fig_didv_a_vd}
563: \includegraphics[angle=0,width=8.5cm]{didv_a_vd.eps}
564: \caption{(Color online) Calculated $I$-$V$ curves for the C chain system by the SC approach and the non-SC approach with interface $X$ = 3 [as shown in Fig. 3 (a)].  Three different voltage drops are considered: $\alpha$ = 0.2, 0.5, 0.8.  The $\Delta\mathbf{H}0$ and $\Delta\mathbf{H}1$ treatments are adopted for $\alpha$ = 0.8. The simple non-SC approach does very well using $\alpha=0.5$ or $0.8$ and the $\Delta\mathbf{H}1$ treatment.
565: }
566: \end{figure}
567: 
568: 
569: \subsection{$I$-$V$ curve of system A}
570: %effect of different voltage drops, $\Delta\mathbf{H}0$ and $\Delta\mathbf{H}1$ treatments}
571: 
572: In order to show effects of different voltage drops and the difference between the $\Delta\mathbf{H}0$ and $\Delta\mathbf{H}1$ treatments [see Eqs. (\ref{equ_shl'})-(\ref{equ_shr''})] for potential shifting, we give in Fig. 8 the calculated $I$-$V$ curves for system A from the non-SC approach with interface $X$ = 3 compared to the SC result. We do the non-SC calculations for three different voltage drops ($\alpha$ = 0.2, 0.5, 0.8) for the $\Delta\mathbf{H}1$ treatment.  In addition, for $\alpha$ = 0.8 we do the non-SC calculation with the $\Delta\mathbf{H}0$ treatment.  Among the three different voltage drops, the result for $\alpha$ = 0.2  is in poor agreement with the SC result while those for $\alpha$ = 0.5 and 0.8 are in good agreement.  This makes sense in view of the main features of the voltage drop in Fig. 4(a): the bias voltage will mainly drop around the left (right) contact for a positive (negative) bias.  However, the small difference in $I$-$V$ curve between $\alpha$ = 0.5 and 0.8 indicates that the $I$-$V$ characteristics is actually not sensitive to the exact change in voltage drop.  By comparing the SC result and the two non-SC results for $\alpha$ = 0.8, we see that the $\Delta\mathbf{H}1$ treatment improves the result markedly. This is understandable because voltage drops in real physical systems are not sharp step-functions but are somewhat smeared out.
573: 
574: 
575: %-------------------------------- Fig. 9 -------------------------------------
576: \begin{figure}[b] \label{fig_didv_a}
577: \includegraphics[angle=0,width=8.5cm]{didv_a_vd55_2.eps}
578: \caption{(Color online) Calculated $I$-$V$ curves for the C chain system by the SC approach and the non-SC approach with different interface $X$ indicated by the numbers. $\alpha$ = 0.5 and $\Delta\mathbf{H}1$ treatment are adopted for the non-SC calculations.
579: %the non-self-consistent approach in recent literatures; the numbers %`1', `2', `3', `4', `5' indicate the results from the present %non-self-consistent approach using different interfaces between %the $C{L'}$ or $C_{R'}$ and $C_{C'}$ regions, as shown in Fig. \ref{fig_cells} (a).
580: }
581: \end{figure}
582: 
583: 
584: %--------------------------------- Fig. 10 --------------------------------
585: \begin{figure}[tb] \label{fig_didv_b}
586: \includegraphics[angle=0,width=8.5cm]{didv_b_2.eps}
587: \caption{(Color online) Calculated $I$-$V$ curves for the Al chain system by the SC and non-SC approaches.  The notations are similar to those in Fig. 9.  `2+3+4+5+6' means a combined interface $X$ (see Fig. 3 (b) ) and each interface bears 1/10 of a bias voltage (see Section III C).}
588: \end{figure}
589: 
590: 
591: \subsection{$I$-$V$ curves of systems A and B: comparison of different approaches}
592: 
593: $I$-$V$ curves for systems A and B calculated by different approaches are given in Figs. 9 and 10, respectively. For the non-SC approach, $\alpha$ = 0.5 and the $\Delta\mathbf{H}1$ treatment are always adopted. It turns out that for small bias voltages ($V_b$ $<$ 0.3V) all the different treatments (the SC, the non-SC with different interface $X$), all give similar results, i.e., the effect of the different voltage drops is very small. Along with the increase of bias voltage, the difference among the different calculations becomes more and more significant. 
594: 
595: For the carbon chain (system A), because the molecule-electrode contact is a hetero-interface and therefore the voltage drop occurs mainly around the contact regions, the SC result and the non-SC results with the interface $X$ located around the two contact regions are in good agreement. As for the transmission function in Fig. 6, once the interface $X$ is within the contact regions the result is quite insensitive to the technical choice. This indicates that for systems made of conductive molecular junctions coupled with metallic electrodes through hetero-interfaces, the present non-SC approach works quite well and can give nearly \textit{quantitatively} correct answers.
596: 
597: For the aluminum chain (system B) the molecule-electrode contact is a homo-interface and the two Al atoms at the ends of the chain are at their bulk positions; therefore, the voltage drop is not localized around the contact regions. Consequently, the result from the non-SC approach with the interface $X$ located around the contact region is not in good agreement with the SC result.  In order to further verify our analysis for system B, we generalize the present non-SC approach for a voltage drip occurring at multiple points: We use a combined interface $X$ = 2+3+4+5+6 in which each layer bears a voltage drop of $V_b$/10. Because of the role played by the overlap matrix in Eqs. (\ref{equ_shl'}) and (\ref{equ_shr'}), the resulting voltage drop will occur over the entire device region. The calculated $I$-$V$ curve by the generalized non-SC approach is given in Fig. 10 by a violet solid line.  The overall agreement with the SC result is remarkably improved, indicating that our analysis is reasonable. 
598: 
599: 
600: \subsection{Limitation of the present approaches}
601: 
602: For finishing the discussion we would like to point out the cases where the present method
603: will not work. Obviously, the present method is only valid for steady state coherent electron 
604: transport through metal-molecule-metal systems; there are basically two cases
605: where our method does not work: (1) electron transport in Coulomb
606: blockade regime, for both the
607: SC and non-SC approaches, and (2) cases where the main feature of voltage drop
608: is sensitive to the value of the bias voltage itself, for the non-SC approach.
609: In the first case, the contact barrier is so high that the molecule
610: and the leads are essentially separated, and as a result, the molecular
611: chemical potential is generally different from the Fermi energies of the
612: leads even under zero bias.
613: Because in our DFT+NEGF approach there is only
614: one Fermi energy under zero bias, it will fail in this case.
615: The second case is just the opposite to that assumed in our
616: non-SC approach. We don't know at this moment what systems will have this
617: behavior or whether such kind of systems exist. But this can be easily checked
618: by doing selfconsistent calculations for several different bias voltages
619: within the bias range interested.
620: 
621: 
622: \section{Summary}
623: 
624: A full self-consistent DFT-electronic-structure-based Green function method has been proposed and implemented for electron transport from molecular devices.  Our method is simple and straightforward while strict.  The implementation is very independent of the DFT electronic structure part; it can be easily combined with any electronic structure package using a localized basis set.  In an effort to avoid the extremely burdensome computational cost for large systems or for $I$-$V$ characteristic analysis, we developed an approximate non-self-consistent approach in which the change in effective potential caused by a bias in the device region of a system is approximated by the main features of the voltage drop.  
625: 
626: As applications of our methods, we calculated the $I$-$V$ curves for two different systems with 
627: different typical voltage drops: a carbon chain and a aluminum chain sandwiched between two 
628: aluminum electrodes.  Our self-consistent results are in very good agreement with those from 
629: other calculations. For both systems the present non-SC approach can give results in good 
630: agreement with the self-consistent results, indicating that it is a good approximate method 
631: with high efficiency for $I$-$V$ characteristic analysis \cite{tddft}
632: (more then one order of magnitude 
633: faster for moderate systems). It is straightforward to generalize this non-SC approach to 
634: deal with any kind of voltage drop situation.
635: 
636: %Our numerical calculations have shown that, overall, the $I$-$V$ characteristics %of a molecular device is actually a quite insensitive quantity to the %change of voltage drop inside the device region. 
637: 
638: 
639: %\acknowledgments
640: This work was supported in part by the NSF (DMR-0103003).
641:  
642: \begin{thebibliography}{99}
643: 
644: \bibitem{mol1}      {\it An Introduction to Molecular Electronics}, 
645:                     edited by M.C. Petty and M. Bryce (Oxford University Press, New York, 1995).
646: \bibitem{mol2}      J.R. Heath and M.A. Ratner, Physics Today {\bf 56} (5), 43 (2003).
647: \bibitem{mol3}      {\it Molecular Electronics}, by James M. Tour 
648:                     (World Scientific Publishing Co. Pte. Ltd., 2003)
649: \bibitem{mol4}      C. Joachim, J.K. Gimzewski, and A. Aviram, 
650:                     Nature (London) {\bf 408}, 541 (2000).
651: \bibitem{mol5}      A. Nitzan, Annu. Rev. Phys. Chem. {\bf 52}, 681 (2001).
652: 
653: \bibitem{dft1}      W. Kohn and L.J. Sham, Phys. Rev. {\bf 140}, A1133 (1965). 
654: \bibitem{dft2}      R.G. Parr, W. Yang, {\it Density-Functional Theory of Atoms and Molecules}
655:                     (Oxford University Press, New York, 1989i).
656: 
657: \bibitem{lang1}     N.D. Lang, Phys. Rev. B {\bf 52}, 5335 (1995).
658: \bibitem{lang2}     N.D. Lang, Ph. Avouris, Phys. Rev. Lett. {\bf 84}, 358 (2000).
659: \bibitem{lang3}     M. Di Ventra, S.T. Pantelides, N.D. Lang, Phys. Rev. Lett. {\bf 84}, 979 (2000).
660: \bibitem{lang4}     N.D. Lang, Ph. Avouris, Phys. Rev. B {\bf 64}, 125323 (2001).
661: 
662: \bibitem{landauer1} R. Landauer, Philos. Mag. {\bf 21}, 863 (1970).
663: %\bibitem{landauer2} M. B\:{u}ttiker, Y. Imry, R. Laudauer, S. Pinhas, 
664: %                    Phys. Rev. B {\bf 31}, 6207 (1985).
665: 
666: \bibitem{test}      For instance, our test calculation for benzene molecule 
667:                     sandwiched between two Au electrodes
668:                     showed that the molecule-electrode charge transfer depends significantly
669:                     on the adsorption site of the molecule on the Au surface. 
670: 
671: \bibitem{negf1}     H. Haug, A.-P. Jauho, {\it Quantum Kinetics in Transport, Optics of 
672:                     Semiconductors} (Springer-Verlag, Berlin, 1996). 
673: \bibitem{negf2}     S. Datta, in {\it Electronic Transport in Mesoscopic Systems},
674:                     edited by H. Ahmed, M. Pepper, and A. Broers (Cambridge University Press,
675:                     Cambridge, England, 1995).
676: 
677: %\bibitem{implems}
678: \bibitem{datta1}    P.S. Damle, A.W. Ghosh, and S. Datta, Phys. Rev. B {\bf 64}, 201403 (2001).
679: \bibitem{datta2}    Y. Xue, S. Datta, and M.A. Ratner, Chem. Phys. {\bf 281}, 151 (2002).
680: 
681: %\bibitem{cals}
682: 
683: \bibitem{transiesta} M. Brandbyge, J.-L. Mozos, P. Ordej\'{o}n, J. Taylor, and K. Stokbro,
684:                      Phys. Rev. B {\bf 65}, 165401 (2002).
685: 
686: \bibitem{mcdcal}     J. Taylor, H. Guo, and J. Wang, Phys. Rev. B {\bf 63}, 245407 (2001).
687: 
688: \bibitem{mono}       C. Krzeminski, C. Delerue, G. Allan, D. Vuillaume, and R.M. Metzger,
689:                      Phys. Rev. B {\bf 64}, 085405 (2001).
690: 
691: %\bibitem{transiesta}
692: \bibitem{transiesta1}J. Taylor, M. Brandbyge, and K. Stokbro, 
693:                          Phys. Rev. B {\bf 68}, 121101 (2003);
694:                      J. Taylor, M. Brandbyge, and K. Stokbro, 
695:                          Phys. Rev. Lett. {\bf 89}, 138301 (2002);
696:                      S. K. Nielsen, M. Brandbyge, K. Hansen, K. Stokbro, J. M. van Ruitenbeek, 
697:                          and F. Besenbacher, Phys. Rev. Lett. {\bf 89}, 066804 (2002).
698: 
699: \bibitem{mcdcal1}    W. Lu, E. G. Wang, and H. Guo,
700:                          Phys. Rev. B {\bf 68}, 075407 (2003);
701:                      C.-C. Kaun, B. Larade, and H. Guo,
702:                          Phys. Rev. B {\bf 67}, 121411 (2003);
703:                      C.-C. Kaun, B. Larade, H. Mehrez, J. Taylor, and H. Guo,
704:                          Phys. Rev. B {\bf 65}, 205416 (2002);
705:                      H. Mehrez, A. Wlasenko, B. Larade, J. Taylor, P. Grtter, and H. Guo,
706:                          Phys. Rev. B {\bf 65}, 195419 (2002);
707:                      C. Roland, B. Larade, J. Taylor, and H. Guo,
708:                          Phys. Rev. B {\bf 65}, 041401 (2002).
709: \bibitem{xue}        Y. Xue, S. Datta, and M. A. Ratner,
710:                          J. Chem. Phys. {\bf 115}, 4292 (2001);
711:                      Y. Xue and M. A. Ratner,
712:                          Phys. Rev. B {\bf 68}, 115407 (2003);
713:                      Y. Xue and M. A. Ratner,
714:                         Phys. Rev. B {\bf 68}, 115406 (2003);
715:                      R. Venugopal, Z. Ren, S. Datta, M. S. Lundstrom, and D. Jovanovic,
716:                         J. Appl. Phys. {\bf 92}, 3730 (2002).
717: 
718: \bibitem{non-sc}     for example, J. Heurich, J.C. Cuevas, W. Wenzel, and G. Sch\:{o}n,
719:                      Phys. Rev. Lett. {\bf 88}, 256803 (2002).
720: 
721: \bibitem{datta0}     S. Datta, W. Tian, S. Hong, R. Reifenberger, J.I. Henderson, and C.P. Kubiak,
722:                      Phys. Rev. Lett. {\bf 79}, 2530 (1997).
723: 
724: \bibitem{siesta}     J.M. Soler, E. Artacho, J.D. Gale, A. Garcia, J. Junquera, P. Ordej\'on,
725:                      and D. S\'anchez-Portal, J. Phys. C {\bf 14}, 2745 (2002).
726: 
727: \bibitem{renorm}     M.P. L\'{o}pez Sancho, J.M. L\'{o}pez Sancho, and J. Rubio,
728:                      J. Phys. F: Met. Phys. {\bf 15}, 851 (1985).
729: 
730: \bibitem{negf_i}     Y. Meir, N.S. Wingreen, Phys. Rev. Lett. {\bf 68}, 2512 (1992).
731: \bibitem{negf_i2}    R. Lake, G. Klimeck, R.C. Bowen, and D. Jovanovic,
732:                      J. Appl. Phys. {\bf 81}, 7845 (1997).
733:  
734: \bibitem{pbe}        J.P. Perdew, K. Burke, and M. Ernzerhof,
735:                      Phys. Rev. Lett. {\bf 77}, 3865 (1996).
736: 
737: \bibitem{tmpp}       N. Troullier and J.L. Martins, Phys. Rev. B {\bf 43}, 1993 (1991).
738: 
739: \bibitem{mcdcal_chain} B. Larade, J. Taylor, H. Mehrez, and H. Guo, Phys. Rev. B {\bf 64},
740:                        075420 (2001).
741: 
742: \bibitem{tddft} After we submitted our paper, we noticed a very recent theoretical work
743: (G. Stefanucci and C.-O. Almbladh, cond-mat/0310084) in which, by building up a formally exact 
744: NEGF+TDDFT (abbreviation for time dependent DFT) scheme, the authors show that the steady state 
745: current in the long-time limit depends only on the asymptotic shape of the one-particle 
746: effective potential of TDDFT.
747: Our numerical result from the non-SC approach is just consistent with this theoretical finding.
748: 
749: 
750: \end{thebibliography}
751: 
752: \end{document}
753: