We initiate the study of quantum agnostic learning of phase states with respect to a function class C⊆{c:{0,1}n→{0,1}}: given copies of an unknown n-qubit state ∣ψ⟩ which has fidelity opt with a phase state ∣ϕc⟩=2n1∑x∈{0,1}n(−1)c(x)∣x⟩ for some c∈C, output ∣ϕ⟩ which has fidelity ∣⟨ϕ∣ψ⟩∣2≥opt−ε. To this end, we give agnostic learning protocols for the following classes: (i) Size-t decision trees which runs in time poly(n,t,1/ε). This also implies k-juntas can be agnostically learned in time poly(n,2k,1/ε). (ii) s-term DNF formulas in time poly(n,(s/ε)loglog(s/ε)⋅log(1/ε)).Our main technical contribution is a quantum agnostic boosting protocol which converts a weak agnostic learner, which outputs a parity state ∣ϕ⟩ such that ∣⟨ϕ∣ψ⟩∣2≥opt/poly(n), into a strong learner which outputs a superposition of parity states ∣ϕ′⟩ such that ∣⟨ϕ′∣ψ⟩∣2≥opt−ε.Using quantum agnostic boosting, we obtain a nO(log(n/ε)⋅loglogn)-time algorithm for ε-learning poly(n)-sized depth-3 circuits (consisting of AND, OR, NOT gates) in the uniform PAC model given quantum examples. Classically, obtaining an algorithm with a similar complexity has been an open question in the PAC model and our work answers this given quantum examples.