The Lightning Network (LN) has emerged as a second-layer solution to
Bitcoin's scalability challenges. The rise of Payment Channel Networks (PCNs)
and their specific mechanisms incentivize individuals to join the network for
profit-making opportunities. According to the latest statistics, the total
value locked within the Lightning Network is approximately \500million.Meanwhile,joiningtheLNwiththeprofit−makingincentivespresentsseveralobstacles,asitinvolvessolvingacomplexcombinatorialproblemthatencompassesbothdiscreteandcontinuouscontrolvariablesrelatedtonodeselectionandresourceallocation,respectively.CurrentresearchinadequatelycapturesthecriticalroleofresourceallocationandlacksrealisticsimulationsoftheLNroutingmechanism.Inthispaper,weproposeaDeepReinforcementLearning(DRL)framework,enhancedbythepoweroftransformers,toaddresstheJointCombinatorialNodeSelectionandResourceAllocation(JCNSRA)problem.Wehaveimproveduponanexistingenvironmentbyintroducingmodulesthatenhanceitsroutingmechanism,therebynarrowingthegapwiththeactualLNroutingsystemandensuringcompatibilitywiththeJCNSRAproblem.Wecompareourmodelagainstseveralbaselinesandheuristics,demonstratingitssuperiorperformanceacrossvarioussettings.Additionally,weaddressconcernsregardingcentralizationintheLNbydeployingouragentwithinthenetworkandmonitoringthecentralitymeasuresoftheevolvedgraph.OurfindingssuggestnotonlyanabsenceofconflictbetweenLN′sdecentralizationgoalsandindividuals′revenue−maximizationincentivesbutalsoapositiveassociationbetweenthetwo.